Join the AI Filmmaker Academy Today! And learn to make AI Films that make money 40% off with coupon TH-cam40 -limited time only. www.aifilmmaker.academy/ Flux Schnell is all about speed, and Flux Dev and Pro are for quality. I’m working on a new video that dives deep into these head-to-head comparisons, including how Schnell, Pro and Dev stacks up against each other and MidJourney. Stay tuned-it’s going to be an exciting week! Which do you prefer?
Flux. I have it running in ComfyUI on my struggling 8GB 2070 Super with the Dev model. It does take some time to create an image, the one I just did took 150 seconds. I lost interest in Midjourney when each new iteration seemed to restrict the content I wanted and the characters looked nicer but more generic with the same prompts I had used previously (also couldn't be arsed with the subscription costs).
Midjourney is one of those things where no matter how many people compare it to others midjourney is still “years” ahead of everyone. And the personalize feature is disturbing.
👋 Dall-E 3 was always better than Midjourney for 90% of my usage. I haven't used Flux enough yet to say the same thing but right off the bat, I love open source and local install so right there, that's a big plus!
Thaaaaaats a huge exaggeration. 1) Used worst version from FLUX. 2) Most of times used just one result from it, but chosen one of four from MJ. 3) Used prompting style from MJ. How on Earth MJ will fail with this circumstances???? I have suspicion Samson is an advocate of MJ =)
yeap, also at 13:40 the flux king clearly has 5 fingers (the index finger partially hidden by the scepter) yet he says 4 for some reason, even drawing an arrow to point it out, and gives his preference to MJ as a result ... for me the "four fingers" king looks way better, and feels more true to the prompt
@@MeshJedi I thought the people on the left were far less attractive and far less stylish. Yes they looked more modern, but they weren't warm and realistic, they were edgy and cartoonish. They literally looked AI generated. The couple on the left were holding hands but the fingers interclasped were odd, there's something wrong with his right eye, his stance isn't quite right and there were weird lines on the right side of his suit that made no sense.
Agreed i was thinking the same thing with the first image his first judgement was completely vauge then the guy makes a subjective opinion win in reality that first image cinematic MJ wins that if the cinematic was an 3d animated movie.
The censorship in Midjourney is now so exaggerated that every square centimetre of skin is immediately cancelled. A summer outfit? Not possible. A beach scene in swimwear? Not possible. At some point, all the models will be wearing burqas. Midjourney is no longer an option for me.
I am finding this too, I feel almost explaining anything to do with a womans body size or clothing gets an instant flag at this point. I slap the same prompts into the bing one lol and it generates the image, not at the MJ quality but it wont have an issue with it while MJ does.
I remember trying to create a portrait for my barbarian character in a DnD game and I got bonked once for asking for "prohibited" content 😅 In the end, I was able to produce a partially shirtless dude, but it was only by inherent associations of the model itself - I couldn't use any such keyword in the prompt, only by the nature of what barbarians in fur clothing and no armor look like, the model inserted the very little amount of nudity on its own. That was more than a year ago though, haven't been using Midjourney for a good while.
Well, I’ve generated thousands of fantasy images that are nudes, by teasing it out of the results instead of explicitly asking for it. You have to keep tweaking via inpaint mode about a hundred times and bouncing back and forth between 5.1 and 6 (because 5.2 will block the output even if you’re not even trying to get anything nude). But at some point (after about a year of doing this) I got a heavy-handed warning and the entire account froze up until I re-read their no-nudity FAQ and agreed to it. (I got busted!) So I read it and agreed and moved on with my other fantasy images. But in general I wish they did have a mode that allowed me to do what I want for my own purposes. I wasn’t even trying to get sex scenes, just natural scenes of fantastic human creatures, and it’s quite a fight sometimes. And since I give them $600+ a year, with privacy mode turned on, you’d think I’d have that luxury. But no.
I don't know if something changed, but I went to explore and rank images this week, and some of the pics were showing tons of skin, and there were even some topless women. I was surprised, because I thought they were avoiding nudity as part of their brand
His comparison between MJ and Dall-E 3 is also very biased towards MJ. I stopped watching after the "Pixar"-prompt... th-cam.com/video/AXv5sgIoPnc/w-d-xo.html
Flux Schnell is substantially inferior to either Flux Dev or Flux Pro. The reason it exists is because it's fast (the word Schnell means "fast" in German) reasonably good images generated in only4 steps. I'd be surprised if Schnell could beat Mid Journey in many head to head comparisons. Flux Dev and Pro can easily match and often beat Mid Journey
@@karlpj1 Not surprised at all it meant to be run either through a cloud service like runpod or locally and if you where actually using you would run 10 images not 1-4 . Plus the prompting is all wrong for this model it uses descriptive prompts like Dalle or ideogram
Midjourney has a 'look'. You know an image was generated with it as soon as you see it. Which in my opinion is not a good thing even if it is aesthetically "better".
I agree. That's exactly what I thought when I looked at the comparative images of the young woman at the beginning of the video. The MJ image was more dramatic, but also slightly less natural. If they asked ME which image was made by AI, I would've picked the MJ version.
I know what you mean... hehhe.... just remember david kicked goliaths ass? And personally, even in this vid, flux managed to smash goliath for me at least.
@@KOSMIKFEADRECORDS So it is written... 😂. I was a born again christian for 31 years. The bible is not the Word of God and it is historically hogwash but it contains spiritual secrets that are mostly unlocked by spiritualists rather than the religious.
@@truepilgrimm Sorry to hear that. But you are dead wrong. Pun intended. The Bible IS the Word of God wether you like it or not or agree with it or not.
Yep. Came in here to mention this! Also, it's possible that the first "four finger" image has the so-called missing finger holding the left side of the camera. Also, the Flux King had a nice crown while the MJ one did not...
@Didi_Meow_Records It's not thinner, it's just partial behind what the hand is holding. All of the hands shots from Flux looked done right. there is a difference between not visible in scene and not physically there. I was able to creat similar looks with my hand. He is definitely biased to MJ.
In comparing Flux and Midjourney, Flux seems more grounded in reality for image generation. My experience in architecture makes the skyscraper comparison particularly revealing. The Midjourney building resembles a Christmas tree, lacking the structural integrity of a real building, specially with the floor levels. In contrast, Flux renders the floor levels and overall design more clearly.
Midjourney certainly has no structure when is related to buildings with some exceptions. When doing panos is a bit better, I guess because it copy references que show more integrally the buildings. In everything else is structurally messy, but artistically inspiring.
I agree. It’s far more accurate to reality. It’s training data is more reliable in my opinion. I thought the couple shown side by side were people I’d like to meet, whereas the MJ couple would prejudge me, and their expressions cold and aloof.
14:30 seriously the one on the right looks far more stylish and suave the one on the left just looks scruffy. Also hate the fringe on the left. The left are just scruffy hippies with blue air
I was planning to say the same thing. When has it become that junkie/homeless looking people are stylish now? These days you’re more original and stylish if you’re just yourself, clean and neat.
I can't agree about the couples, though maybe it's because I'm a boomer. The couple on the left look vacant and unaware of their surroundings, and slavishly follow today's conventions--tattoos, of course, though thankfully no piercings or nose rings. The couple on the right seem not only to be conscious, but actually enjoying the situation. I can't say their look is timeless, but I think, or maybe just hope, that the fashion adopted by the left hand couple disappears quickly.
@@ElHongoVerde nah, it's just that "visual quality" and "style" are quite subjective, and he is probably used to MJ more, and values this style more which is why he chose MJ over SD in the first place.
Jolly kings crown is spelled wrong. lol. And flux still got it. The flux image has 5 fingers, one goes up the sword hilt. Same with the 4 fingered camera shot.
Regarding the couple you gave to mj, flux did give you exactly what you wanted. And regarding the king with open arms, he did have 5 fingers. The hand shown with 4 fingers has a fifth finger going upwards against the handle in the back of the handle, you can actually see it... seriously gotta look at the details. Fair judgement comes at the price of not favorism one over the other due to one has funnier pic, but for what it will give you of what you've written.
@@silverstreetman I used this tutorial for my Comfy UI setup and Comfy UI Manager + Stability Matrix: th-cam.com/video/aPQ8gvTNCKM/w-d-xo.htmlsi=uWe2KThFMW5fgWHJ. So results may vary. There was one link in the description that was busted, but if you match the file names/file sizes you should be able to deduce what you will need. Hope that helps!
So what you're making is a thought provoking take in Matrix? Because I think if they did a really serious and thought and emotion take in Matrix, it would have been 10x the movie it is now. Dodging bullets and action is cool. But one that really makes you think about being in an emulation, and really digs deep down, and makes the viewer wonder, would have been a much better film.
If i pay for it i use it as i want. But my payed AI video generator (of another brand) also allow me to do that and have done several versions and its sure fun. 😃👍
14:45 you mentioned that you preferred the image that MJ generated, but in the end, Flux followed the prompt faithfully, while MJ didn't. I don't think you noticed it, but you asked for a yoda motif on the t-shirt, yet MJ generated an R2D2 imagine on the t-shirt.
On the first image, Flux gave you something truly cinematic, where MJ gave you more of a pretty photo. As far as I'm concerned, Flux followed the prompt much more closely as a result.
Was about to say the same, he kinda lost me there completely. Plus, this is absolutely subjective who is "more attractive", at least where I live, I would probably risk a beating going out like this.
Right? The man on the left also has zero delineation between the undershirt and the pants, it's just one long seam like he's wearing some sort of funky onesie under the suit jacket. The one on the right didn't really get the spiky hair bit well, but it got all other prompts, and the woman on the left has the wrong iconography entirely. Sure MJ pretty consistently has more details that were not part of the prompt, but that's not necessarily a good thing if you're trying to control the output and it veers into an unwanted direction. The Stylization of the architecture was also a clearly biased viewpoint again. With him knocking the flux image for being at 'an odd height' when the other image is the exact same in that regard, jsut wit han additional pan upward, plus the MJ one is so very stereotypically overdone as cyberpunk which was stretched to beyond even it's norm, with large swaths of the building being plastered in screens where it isn't coherent, whereas the Flux one has more clean stylized buildings above the walkable visual height. Honestly, they both lost in this one to me. One was way overdone, the other could have used more detailing but it at least looked like a proper building, but his bias for MJ clearly shows time and again. Then of course, there was the 'slightly misinterpreted' astronaut picture of MJ's that completely failed at the prompt for the astronaut to be hatching from the egg, and how he only allowed 2 shots from the hand comparison for Flus to MJ's 4. He might as well have just started the video with 'I like MJ better regardless of results' then closed the video out.
Would have been more interesting, I think, to compare Midjourney with the Pro version. Sure it would be more expensive, but probably still cheaper than MJ and I bet the results would have been far superior. Anyway, still an interesting video and appreciated your examples.
Fantastic stuff - Im a regular user of Freepik as its less sensored than Adobe and with my fashion model Photography work im able to successfully use Freepik with NSFW projects for retouch and expand and creative upscale all in one tool, whereas photoshop is painful and rather frustrating to try to get it working with any shot that shows more than a bare elbow. Additionally, Freepik just last week sneaked in the Flux model as part of their options and its a great addition
You're comparing Midjourney's best model to Flux's smallest and least-capable model. The fact that Flux schnell generates anything close to Midjourney is a win for Flux. If you compared the Flux Dev model, it would be about equivalent and Flux Pro wins out.
With such SFW demands would need to shut down 90% of Hollywood's studios, and with them half the city as part of their outside infrastructure. These restrictions apply not only to Midjorney, but also to most online AI-graphics services. For example, they refused to depict Madonna with a baby - child, partially nude, religious theme. That's why I balked at these AIs and bought an RTX 2060 (for the start). 8 GB VRAM and 32 GB RAM on an Intel Core i5 7600 is enough to run Flux.1 dev - runs an order of magnitude slower than the regular models, but the quality is significantly higher and - no restrictions on content. (Translated from Russian in DeepL)
15:42 -- i like the contrast and stark lines on the Flux city scene... MJ often adds TOO MUCH detail these days that clutters the image, unlike older MJ versions that are less "busy"... when i am compositing images together i don't want a bunch of extra "fake" detail that is just clutter... if that makes sense?
You can get around that a little with settings or prompts, but I know what you're saying, like it's trying to hard. But actually that is what I like MidJourney for, it's more artsy, more abstract and surreal, I feel like I'm creating art that doesn't exist. Now Flux better not go that way, because it's like earthbound dead realistic and amazing. So I use both, I hope MidJourney doesn't try to be Flux, I mean fix the body issues, but they better not lose the wild unrealistic artsy style.
@19:57 I'm using flux-dev and flux schnell on my computer in ComfyUI with a 4060 Ti 16GB. On dev with a simple scheduler with 20 steps for 1024x1024 it takes about 50 seconds per image. On schnell with a simple scheduler with 4 steps on a 1024x1024 it takes about 9 seconds per image. I'm new to AI image generation locally but I'm very excited about what can be done on a ($450 retail) / ($250 used) GPU.
Same, i'm using Flux Schnell on 1650 4Gb 4 steps, takes 3 mins per image 640x360 but upscaling it to 1280x720, takes longer, but quality is still amazing.
I think they use the censorship as an excuse so they don't have to "work" all the data. IMO. They could easily make a section for adults only. All of them could. There are "R" rated movies and scenes in the real world. Are they serious about creating art or not?
I hope, just take of the guardrails for those that pay and sign off on a section saying we understand, also we should be able to flag something we feel is over the edge, I mean we don't want any pictures of violence against real living people, or nudity of real people, they can keep logs and kick off offenders after warning, and we too can flag stuff we make that goes off the rails. I had a few results that were like ummm I didn't ask for that, so the AI can if the guardrails are off do some really bad things, we should be able to flag and delete, and they should have ways to monitor, but you know it's not easy so I kinda understand, I guess if everyone was good about it, but just look at discord and what Dalle3 did initially, a few people ruined it for us, people tend to go to far to shock value, probably kids, it's understandable too. But yeah they make movies like SAW that are crazy sick, I can't even watch, but I don't mind, I just don't watch. But like we make an image with a person tied down at gunpoint for a crime comic cover or crime book and that's bad? I try to do comic covers like the ones from the 50's pre code horror and get BLOCKED. Open source will fix that, but I want Dalle3 and MidJourney to add R sections.
Before, I could at least understand it when it was using discord, but now that they have a version that runs off its own site, its increasingly strange to censor things to this extent. I guess the only reason they censor anything to to avoid getting into hot water regarding political figures, CP, and whatnot. But the idea that they cover women to the extent that they do is still baffling. I'm not even prompting for nudes, but it still refuses to show leg and shoulders a lot of the time.
Nothing is more frustrating than creating a detailed prompt, only to have Midjourney tell me that SOMETHING is a banned, but doesn't tell me what it is. Then going through every word trying to figure out what could potentially be objectionable to MJ, even though it all seems fairly innocent.
I prefer " variety " to the more contemporary and often politicized term "diversity." To me, "variety" captures the essence of differences and distinctions more neutrally and inclusively. It emphasizes a broad spectrum of elements and experiences without the potential ideological connotations that "diversity" might carry in current discourse. By focusing on "variety," we can appreciate the richness of different perspectives and contributions without the baggage that sometimes accompanies the term "diversity." This choice of words helps foster a more open and straightforward dialogue about differences and their value to our interactions and environments.
"Diverse also somewhat suggests / connotates "incompatible" or at least I think it has some of that baggage in common usage. "Variety" is not a bad synonym...
@@excelerator well thank you very much for this wonderful compliment. It really makes a persons day when somebody else acknowledges another persons observation.
Thanks for a very subjective review of both, I do however find the Flux characters to be more photo-realistic. The first one of the girl, as soon as the MJ one came up, even though the lighting is superb and you could even see the tiny hairs under her chin, something about the skin just screamed "computer generated" to me (whether that was because it was too perfect and didn't have any imperfections in it?). She looked like a well modeled game character where as the Flux, whilst being 'scandanavian noir' as you put it, the skin tone and texture looked like she was real. The same with the couples, again the MJ looked like well modeled game characters and even though the rest might have looked funny, the Flux characters skin tones and features looked more realistic to me.
No that's not how I see it with those couples in the images, The MJ version looks like two left wing hippies, and the couple on the right look like conservatives
Fascinating - you think the couple on the left looks "much more attractive"? They look gross to me lol, girl caked in makeup, both covered in tattoos, guy's hair looks dirty and messy, and not a fan of the guy's outfit at all. I'd give one point to the left girl's shirt over the one on the right but that's about it from my perspective. From a technical standpoint though, the pic on the left is more photo-realistic in my opinion.
You've taken the most basic option available in FLUX and pipped it against the best of what Midjourney offers! Not only that, some of your conclusions are fundamentally flawed especially with regards to numbers of fingers. It appears you count fingers as well as Midjourney does!
I train my own models on my own art and photography with flux and the sheer POWER of piloting a 16 channel pipeline like this is.. difficult to articulate. It's also free. Of course, my hardware wasn't, but all things considered; I don't think an RTX 4090 or the odd lease of an H100 cluster for larger trainings is all that much when you consider the sheer gravity of what you're getting in return. This stuff really is science-fiction-good. It's blowing my mind daily.
Thank you Samson, I just Gave Flux Schnell a go for the things I use it for (elements for book publishing), and it's the ONLY text-to-image AI tool tool that rivals midjourney for me. Really impressed with some of the outputs. I like Midjourney for some things but this will become a regular too.
Couple in left looks like they migrated from village in developing country,and showcasing themselves as "cool" and "style" while looking like people who are raised on tiktok. On the right-more or less normally clothed people with style (except for painted hair).
In both cases where you say FLUX only shows 4 fingers, I can easily see that there are 5 fingers, but one of them is not clearly visible because of the position of the hand. At t=50, the little finger appears to be wrapped around the camera the hand is holding. And at t=13:11, the right-hand index finger is clearly visible behind the staff. (You're paying $30/month for Midjourney and nothing for FLUX Schnell.)
@@nexusknight7 Dont know for sure but what I think the reason why is they still want to people to join the community through the discord, if somebody have a problem with anything, they ask through the Discord and they still want to expand the community here because of reasons(idk), so the 100 images condition is just because of that. On the MJ office hours they said that it will be soon that you dont need discord at all and open it up for more people, so thats it.
midjourney appears to be generating a more "atmospheric" picture, but it is compared against the weakest flux, so I would expect some disparity. with the couple picture, the one on the left was more in stylistic similarity, but the worman on the right was more natural, even if the man was a bit "painted" in clothing, and not so much pared in style.
I'm running dev on my desktop, works great! And oh, it's hard to forget it's FREE! Besides the cost of the watts anyway, but way less money and less restricted than other solutions.
you totally forgot to mention Stable Diffusion with its Loras. You can generate custom content that is 100% impossible with all the other tools ( right now ). Image your kids draw their own character in their own style. You can make a Lora of it let it go on an adventure =)
The problem with Midjourney is that it is too censored, you çannot create erotic scene, even if it isn’t really erotic, you cannot create war scene with blood. And finally the price is quite high for something with so much restriction. They need to do something against it before to lose the battle against other systems like Flux, Fooocus, etc..
For me it feels like MJ just has its own distinguished style while flux needs a little bit more guidance. And the comparison feels very biased because you know how to prompt for MJ to get the results you like and apply these prompts to Flux. So in the end you just find out how well your MJ prompts work in Flux and are not really comparing the two models.. But I understand the lifecycle of YT doesn't allow for waiting until we figured out how Flux prompting works ^^
I'm going to have to give the win to Flux here. My promt "Space Sherrif T-Rex holding a Winchester rifle" gave me a dinosaur hold a rifle (granted, it was a shotgun) but still better than anything sfw/violence censored which at best made my dinosaur holding a parasol.
16:46 this is always a flaw in people's criticisms and or reviews of AI. As you have stated this is a German company if you look at the man and woman that they have created it is very German in its look. These are bias that are put into every AI whether it is an image generator or a chatbot. There is always a bias left behind by the programmers and those that engineer chatbots.
Well... for the "censorship" part i found out that Flux (using it locally with ComfyUI) is ok with boobs and butts but doesn't want to draw the rest... so it isn't really uncensored; non totally at least
13:46 Midjourney (V6.1) adds a woke bias to the prompt and got the following wrong: - age - adolescents instead of adults, - suit colour - gave lilac pink instead purple, - t-shirt graphic - gave R2D2 instead of Yoda. Flux (Schnell) follows the prompt more accurately.
Midjourney images always seem to have a sameness to them, oversaturated, like all Thomas Kinkade paintings... 14:50 ... the couple on the left is more attractive? They look like they shop at Goodwill! 🤣🤣🤣
In the 1st comparison, the Midjourney model seems from a game, it isn’t photo realist. It is artistic for sure, but nothing to see with a photo taken with a digital camera on the street.
@@karlpj1 I stated imagine if Canva who know owns Leonardo and Affinity will offer AI inside Affinity apps like Adobe does. I think you did not understand what I wrote.
Sites where you have to Log In, give your email or even your phone to get to play around isn't "Free". Make mention of that in your next videos. "You have to log in or make an account" etc. Giving your info is still a price to pay.
Most artists are actually taking data from movies, shows and other sources of art. We/they just don't realize it because we don't store that data like AI would making it easily retrievable. I know of well known artists in all fields that that are similar to other artists, but with their own unique style, and when you ask them what inspired them, they usually reveal the source. From there, you could find the similarities. Say the movie Pulp Fiction. Tarantino literally tells us his ideas came from the writer Elmore Leonard. And he's right. But should Elmore sue everybody that does similar things to his work? Who inspired him?
how is this shitty argument not dead yet :D even copying a style 1:1 takes years of dedication, skill and effort, while also being a conscious choice. A person can choose a picture as reference, but they can also IGNORE a picture. AI does not choose what it trains on, nor can it choose to ignore certain pieces. also, as any experienced artist will tell you, the biggest inspiration of an artist is the real world.^^ btw, we store data, AI doesn't. We can recall an image we have seen and, if we are good enough, recreate it 1:1. AI can't, because it does not store images, it only breaks down every image into a vector with much lower dimensionality, trying to extract the most important features, then updating its model weights accordingly.
@@thomasmann4536 We are a more "advanced" version. But you can train simple AI models, and draw a basic chicken. Then train the AI, like the artists have trained, to make it a different style. I'm an artist. I can draw. Just like AI I used a graph to get the 3D right. I even use my hand to make similar shapes of what I'm drawing as a reference. AI is just a "tool' It can't innovate like you or I can "yet". But, if you are an artist, use that tool. It can lessen the workload. Maybe learn how to train your own model just for you. It's pretty cool stuff. You can layer your models and advance your styles much faster.
@@insurancecasino5790 my friend, what you say isn'T wrong, though also not fully right. btw, im a 3d animator but my academic background is in AI, Ive studied NLP under Björn Ommer, one of the creators of stable diffusion, and the tech is always fascinating to me. I actually made my own models, and tinkered around with the tech. The biggest difference between using conventional tools and AI is this: you can get really good at Blender or Photoshop and ten create EXACTLY what you want CONSISTENTLY. But no matter how good you get at using AI, you will never be able to generate exactly what you want consistently. it's just the inherent probabilistic nature of these models. ya, you can use them as base, or inspiration, which i often do. but in the end if you were to rely on AI entirely, youre just trading a skill limitation for a model limitation, which is one you can't overcome yourself.^^
@@thomasmann4536 Depends on what type of art. If you're brush is random to see what might inspire you, AI is "perfect" for that inspiration. I generated a chicken with sunglasses. Welp, it put these giant sunglasses in front of a tiny chicken on a rock in a country style landscape. So, I asked myself, who does those glasses belong to? A giant? Then it gave me the surreal idea of a giant cooking his chickens with magnifying glasses that look like two magnifying glass made into magnifying sunglasses. You don't realize what's going until you look at the chicken, and notice it's actually cooking. I would have never come up with that without AI. And there's the surreal connection to "sun" glasses.
@@insurancecasino5790 thats an unfalsifiable statement. nobody can prove you wrong on this because it already happened and now you already have this idea in mind. You might have not gotten this idea had you not used AI. But you would have probably gotten a different one. Maybe a better one?
I've noticed that a lot of models will generate generic persons (without specifying race, age, sex, etc.) in a characteristic way because they've been trained on specific sets of provided images, possibly involving the hiring of models from the local region. Certain Stable Diffusion models (e.g., Deliberate V2) seem to favor Asian looking people, while Flux tends to generate people who look German. But they can be a little more creative and flexible with the right prompting. I definitely look forward to trying out LoRAs with Flux to see how well they help to steer Human characters towards a desired look.
In fact, in the architectural prompt, I really prefer Flux's result, because MJ's might look amazing, but it is a bluff in realism and actual architecture sense, if you would try to 3D model MJ's result you would suffer trying to make true, makes sense shape that can replicate that result. Where as in the flux result, the shapes are clear, defined and makes sense of how the building is constructed. I would much rather take the inspiration from flux than from MJ to achieve something "Real"
With the advancements of softweare and hardware self-hosted AI servers will become a norm in the near future. This would be an amazing time for creative people =)
I used all 3 Flux, and I can't believe you made a whole video with its lowest level to go up against MJ. I must admit tho you got some incredible shots from Schnell. Flux Pro should be your next video. Even Flux dev is much better than Schnell too
Flux is really amazing, being able to run it locally with 4Gb Vram and 32 Gb system ram it only takes about 3 minutes to produce very beautiful images. The NSFW does need a bit more development, but that's not what i'm interested in, i'm just happy i can run it locally and play around with it, oh yeah, and flux is free. Rating the images you created: 1. Flux looked more realistic than MJ. 2. 4 tries vs 2 tries might not be very fair, but hands are difficult. 3. You saw the result, but flux can mess it up trust me. 4. The king from Flux does look more realistic and both hands do have 5 fingers, MJ didn't show hands so hard to fault flux there. 5. The flux couple were prettier imo, but the background for MJ was better. 6. MJ had the better image, but again Flux Schnell is their lowest quality model and still free vs paid MJ 7. Both did well, Flux interpreted better but looks a bit 3 model like
I would say that Midjourney is great to create graphics or characters for video games, a little bit like Unreal Engine. we cannot tell that the quality of the skin is photorealistic because it seems always a little bit synthetic. If you take a picture with a camera you will see the difference I think, very easily. But the artistic result is very good for sure with great DOF, color range, etc.. the example of the happy king, we can see that it isn’t realistic. If you compare the skin of the Midjourney King with a photo of Theoden from the Lord of the Rings movie, you can see the difference easily. For an animation movie or a game introduction, it is perfect, very artistic but I’m not sure that it would be good enough for a real movie like the lord of the Rings, or maybe for very fast and short scene where we cannot see the difference.
as an architect the Flux skyscraper makes more sense on the realism aspect, also the scale and proportions are more believable, the MJ is not bad but is more focused on digital art examples that I do like, but are not as detailed and as believable, also the king prompt in flux had 5 fingers, but one was behind the sword.
Flux has good image quality, and it can do text and signage. However, Flux is also censored, and its generated human models still have six fingers and six toes per appendage. So not yet impressed. And I used the Pro. Let's see what happens in Pro.2
Midjourney was an extremely traumatic experience for someone so excited about AI art it was unreal. You can’t make anything even remotely creative with it. I mean truly creative. The censorship just plain cannot work. Your first clue is that god allows free creativity THEN selects in evolutionary processes. Your second clue is that every word processor, the printing press, etc followed the same pattern without question because true creativity by definition cannot be controlled or anticipated. You cannot remove words in a language because they might be misused. Its so stunningly 3rd Reich “degenerate art” law ridiculous that I was speechless 😶 Pun definitely intended 😂😂😂👁️
I was just comparing Flux Schnell and Flux Dev on my local machine and my 12gb RTX 3060 and 32gb ram handled both fairly well in comfyui. Schnell took about 40 seconds at 4 steps and Dev took about 120-140 seconds at 20 steps. The Dev images were much more detailed and realistic, but took longer. That's the trade-off, I guess.
@@gatwick127 You can sell your outputs. You just can't set up a service using the models or derivative models and charge for it. Here's the part of the license that talks about outputs: Outputs. We claim no ownership rights in and to the Outputs. You are solely responsible for the Outputs you generate and their subsequent uses in accordance with this License. You may use Output for any purpose (including for commercial purposes), except as expressly prohibited herein. You may not use the Output to train, fine-tune or distill a model that is competitive with the FLUX.1 [dev] Model.
flux 1000%. its open source. anything open source gets an extra 20 pts to begin with. i feel there was shade on the first test towards flux and you rated it based on your prefrence than what it was told to do. keep in mind youre also using the lowest end model of flux. if you were gonna have it be hosted on a server shoulda gone toe to toe with the pro to be fair but im glad you showed off the consumer grade cause now im going to install it. thanks :D
Yup, it's very strict, I might jump ship unless they loosen up. You can't even do a normal woman in a bikini on the beach, or horror. I get some false positives too. But it's still crazy good.
The power of open source is, it constantly leaps forward because thousands of developers work on it, while close source is limited to a handful of developers. FLUX is very promising, open source, uncensored, only a few weeks old and it's already taking the AI world by storm. I'm sure it will surpass all the other AI platforms, sooner or later. Oh, and open source is FREE.
Hey! Great video. Just a heads up, I noticed a few mouth noises that were a bit distracting. There are tools that can help clean up mouth noises and make your audio more polished (like iZotope mouth de-click, which it's very easy to use but I'm sure there are other ones).
I think this is simply incorrect in a foundational term. Obviously not disputing your valid opinions on things, however objectively, you asked flux for something, and the question should be did it provide what you asked for. I have used midjourney extensively in the past and now locally run flux dev. The reasons i disagree at the top level are - First you compare the ‘worst’ flux model against the best midjourney, its not a like for like comparison. Then you openly state midjourney produces 4 images and you pick, most of these tests you give flux 1 chance. You also state that both images meet the prompt but you prefer midjourneys bright art style, was that in the prompt?You state flux will follow the prompt better, so you could say you arent asking flux for an art style you prefer, so itll never win? On flux dev, i have many pictures of hands, not just human hands and it fails perhaps 5% (and thats being pessemistic) of the time, in not giving what i expect in terms of biology for said creature. Id imagine this is better on pro even. Im obviously not intending to dismiss your opinion, MJ is beautiful too, i just disagree with the testing methodology you have come up with between these. It seems to me that you personally prefer MJ output art style and that affects your objectivity on this.
amazing video you got a new sub. but I personally prefer Flux's color palate on many of the images but MJ did really good more so on the city cyberpunk style image loved the MJ details on the buildings. but that is just me. lol
Join the AI Filmmaker Academy Today! And learn to make AI Films that make money
40% off with coupon TH-cam40 -limited time only.
www.aifilmmaker.academy/
Flux Schnell is all about speed, and Flux Dev and Pro are for quality. I’m working on a new video that dives deep into these head-to-head comparisons, including how Schnell, Pro and Dev stacks up against each other and MidJourney. Stay tuned-it’s going to be an exciting week!
Which do you prefer?
Yes
@@huwhitememesSame
Flux. I have it running in ComfyUI on my struggling 8GB 2070 Super with the Dev model. It does take some time to create an image, the one I just did took 150 seconds. I lost interest in Midjourney when each new iteration seemed to restrict the content I wanted and the characters looked nicer but more generic with the same prompts I had used previously (also couldn't be arsed with the subscription costs).
Midjourney is one of those things where no matter how many people compare it to others midjourney is still “years” ahead of everyone. And the personalize feature is disturbing.
👋
Dall-E 3 was always better than Midjourney for 90% of my usage.
I haven't used Flux enough yet to say the same thing but right off the bat, I love open source and local install so right there, that's a big plus!
As a human artist I am never censored, 15 years of experience of drawing boobs.
Boobs are the best invention ever
Man of culture.
I can think of no finer use of your talents. 😊
So, you draw one-handed?
@@ClockworkGearhead ofcourse
Thaaaaaats a huge exaggeration.
1) Used worst version from FLUX.
2) Most of times used just one result from it, but chosen one of four from MJ.
3) Used prompting style from MJ.
How on Earth MJ will fail with this circumstances????
I have suspicion Samson is an advocate of MJ =)
Agree : (((
yeap, also at 13:40 the flux king clearly has 5 fingers (the index finger partially hidden by the scepter) yet he says 4 for some reason, even drawing an arrow to point it out, and gives his preference to MJ as a result ... for me the "four fingers" king looks way better, and feels more true to the prompt
and the people in MJ are more atractive (are they?) and have better style (never requested in the prompt)
@@MeshJedi I thought the people on the left were far less attractive and far less stylish. Yes they looked more modern, but they weren't warm and realistic, they were edgy and cartoonish. They literally looked AI generated. The couple on the left were holding hands but the fingers interclasped were odd, there's something wrong with his right eye, his stance isn't quite right and there were weird lines on the right side of his suit that made no sense.
Agreed i was thinking the same thing with the first image his first judgement was completely vauge then the guy makes a subjective opinion win in reality that first image cinematic MJ wins that if the cinematic was an 3d animated movie.
The censorship in Midjourney is now so exaggerated that every square centimetre of skin is immediately cancelled. A summer outfit? Not possible. A beach scene in swimwear? Not possible. At some point, all the models will be wearing burqas. Midjourney is no longer an option for me.
Middle East Journey 😂
I am finding this too, I feel almost explaining anything to do with a womans body size or clothing gets an instant flag at this point. I slap the same prompts into the bing one lol and it generates the image, not at the MJ quality but it wont have an issue with it while MJ does.
I remember trying to create a portrait for my barbarian character in a DnD game and I got bonked once for asking for "prohibited" content 😅 In the end, I was able to produce a partially shirtless dude, but it was only by inherent associations of the model itself - I couldn't use any such keyword in the prompt, only by the nature of what barbarians in fur clothing and no armor look like, the model inserted the very little amount of nudity on its own. That was more than a year ago though, haven't been using Midjourney for a good while.
Well, I’ve generated thousands of fantasy images that are nudes, by teasing it out of the results instead of explicitly asking for it. You have to keep tweaking via inpaint mode about a hundred times and bouncing back and forth between 5.1 and 6 (because 5.2 will block the output even if you’re not even trying to get anything nude). But at some point (after about a year of doing this) I got a heavy-handed warning and the entire account froze up until I re-read their no-nudity FAQ and agreed to it. (I got busted!) So I read it and agreed and moved on with my other fantasy images. But in general I wish they did have a mode that allowed me to do what I want for my own purposes. I wasn’t even trying to get sex scenes, just natural scenes of fantastic human creatures, and it’s quite a fight sometimes. And since I give them $600+ a year, with privacy mode turned on, you’d think I’d have that luxury. But no.
I don't know if something changed, but I went to explore and rank images this week, and some of the pics were showing tons of skin, and there were even some topless women. I was surprised, because I thought they were avoiding nudity as part of their brand
MJ is not for artists, the very fact it is so heavily censored means it can never really be an artists tool no matter how good the images it produces.
it's good for amateurs 😅
@@Broockle not even for amateurs
@@omegaidol
more like people that really only use photoshop or gimp to snip and move images around for collages 😅
I think comparing the lowest model of Flux with MidJourney is a bit biased... intentionally.
His comparison between MJ and Dall-E 3 is also very biased towards MJ. I stopped watching after the "Pixar"-prompt... th-cam.com/video/AXv5sgIoPnc/w-d-xo.html
Right, he’s very biased IMHO but he’s not claiming to be totally objective. He doesn’t try to hide his bias. 😂
Also, he is choosing one of four images for Midjourney and is comparing it against the first generated with Flux 🤨
Flux Schnell is substantially inferior to either Flux Dev or Flux Pro. The reason it exists is because it's fast (the word Schnell means "fast" in German) reasonably good images generated in only4 steps. I'd be surprised if Schnell could beat Mid Journey in many head to head comparisons. Flux Dev and Pro can easily match and often beat Mid Journey
Its surprising he tested the worse quality version
Das ist funkspeil.
Schnell, meinen üntermenchen, schnell!
@@karlpj1 Not surprised at all it meant to be run either through a cloud service like runpod or locally and if you where actually using you would run 10 images not 1-4 . Plus the prompting is all wrong for this model it uses descriptive prompts like Dalle or ideogram
I agree - why did he test the worst version of FLUX LOL I use PRO and it is amazing!!
Midjourney has a 'look'. You know an image was generated with it as soon as you see it. Which in my opinion is not a good thing even if it is aesthetically "better".
Flux has a look in similar way. Flux is a great model that is easily trainable too.
I agree. That's exactly what I thought when I looked at the comparative images of the young woman at the beginning of the video. The MJ image was more dramatic, but also slightly less natural. If they asked ME which image was made by AI, I would've picked the MJ version.
You should do a Midjourney vs the Flux Pro video. Comparing MJ to the Flux Schnell is like comparing David and Goliath. Try Goliath vs Goliath.
.5cents per image is kind of steep better of using pro or a fine tune on CivitAI
I know what you mean... hehhe.... just remember david kicked goliaths ass? And personally, even in this vid, flux managed to smash goliath for me at least.
@@KOSMIKFEADRECORDS So it is written... 😂. I was a born again christian for 31 years. The bible is not the Word of God and it is historically hogwash but it contains spiritual secrets that are mostly unlocked by spiritualists rather than the religious.
Yeah, I didn't think it was a fair test
@@truepilgrimm Sorry to hear that. But you are dead wrong. Pun intended. The Bible IS the Word of God wether you like it or not or agree with it or not.
King did have 5 fingers, 1 was going up the sword handle
A much thinner finger sticking out of the pointer finger tho
Yep. Came in here to mention this! Also, it's possible that the first "four finger" image has the so-called missing finger holding the left side of the camera. Also, the Flux King had a nice crown while the MJ one did not...
i was searching for this comment
@Didi_Meow_Records It's not thinner, it's just partial behind what the hand is holding.
All of the hands shots from Flux looked done right. there is a difference between not visible in scene and not physically there. I was able to creat similar looks with my hand.
He is definitely biased to MJ.
In comparing Flux and Midjourney, Flux seems more grounded in reality for image generation. My experience in architecture makes the skyscraper comparison particularly revealing. The Midjourney building resembles a Christmas tree, lacking the structural integrity of a real building, specially with the floor levels. In contrast, Flux renders the floor levels and overall design more clearly.
Midjourney certainly has no structure when is related to buildings with some exceptions. When doing panos is a bit better, I guess because it copy references que show more integrally the buildings. In everything else is structurally messy, but artistically inspiring.
I agree. It’s far more accurate to reality. It’s training data is more reliable in my opinion. I thought the couple shown side by side were people I’d like to meet, whereas the MJ couple would prejudge me, and their expressions cold and aloof.
14:30 seriously the one on the right looks far more stylish and suave the one on the left just looks scruffy. Also hate the fringe on the left. The left are just scruffy hippies with blue air
I was planning to say the same thing. When has it become that junkie/homeless looking people are stylish now? These days you’re more original and stylish if you’re just yourself, clean and neat.
Absolutely. I have no idea what he's on about the left couple being more stylish or more attractive
"Midjourney did a much better job of follow the prompt" .... proceeds to show a t-shirt with R2D2 on the left side and actual Yoda on the right...
I can't agree about the couples, though maybe it's because I'm a boomer. The couple on the left look vacant and unaware of their surroundings, and slavishly follow today's conventions--tattoos, of course, though thankfully no piercings or nose rings. The couple on the right seem not only to be conscious, but actually enjoying the situation. I can't say their look is timeless, but I think, or maybe just hope, that the fashion adopted by the left hand couple disappears quickly.
I came to say the same. I totally agree with you, James. I hope is only sarcasm from him... Or he may need new glasses 😂
The couple on the left looks insufferable 😂
@@ElHongoVerde nah, it's just that "visual quality" and "style" are quite subjective, and he is probably used to MJ more, and values this style more which is why he chose MJ over SD in the first place.
This definitely. The couple on the left looks insufferable and horrible.
Yeah, I agree. The couple on the left also looked lgbtq and woke...
Jolly kings crown is spelled wrong. lol. And flux still got it. The flux image has 5 fingers, one goes up the sword hilt. Same with the 4 fingered camera shot.
13:26 It does appear that the Flux king has an index finger - but it is incredibly flexible to be in that position. IMO.
Midjourney is censored and they make your ai art results public for everyone to copy and use for themselves unless you pay extra $$$ for privacy.
well, technically it isn't your art. Midjourney generated it for you.
Further MJ does hardly have all the other tools SD has - why people are so crazy about MJ I will never get.
@@retroelectrical Their terms of service says otherwise.
Я из России, ни разу не видел своих работ в разделе работы пользователей) +50К генераций _ так что велком если вас это смущает)))
@@cekuhnen because it offers peoplet without a decent GPU-setup to generate images?! Not everyone is so rich like you
Regarding the couple you gave to mj, flux did give you exactly what you wanted. And regarding the king with open arms, he did have 5 fingers. The hand shown with 4 fingers has a fifth finger going upwards against the handle in the back of the handle, you can actually see it... seriously gotta look at the details. Fair judgement comes at the price of not favorism one over the other due to one has funnier pic, but for what it will give you of what you've written.
Flux has been great, installed yesterday. Low VRAM model FTW!
Does it work with 8gig vram ? The dev model ?
Could you please point us to an easy way to install it. Thank you.
Can you use its API remotely?
@@Iriesu Will likely take 2-4 minutes to generate. I run 2k prompts on a 4080 super (16 gb vram) and it's less than a minute.
@@silverstreetman I used this tutorial for my Comfy UI setup and Comfy UI Manager + Stability Matrix: th-cam.com/video/aPQ8gvTNCKM/w-d-xo.htmlsi=uWe2KThFMW5fgWHJ. So results may vary. There was one link in the description that was busted, but if you match the file names/file sizes you should be able to deduce what you will need. Hope that helps!
13:45 - Schnell *did* include all five fingers. The index the right hand is hidden behind the scepter.
I'm doing a movie with AI and censorship is a huge problem. I'm doing a cross between The Matrix and 1984.
Can't wait to see it!
Right. I did the same thing with a Superhero video. Characters fighting has always been a problem
So what you're making is a thought provoking take in Matrix? Because I think if they did a really serious and thought and emotion take in Matrix, it would have been 10x the movie it is now. Dodging bullets and action is cool. But one that really makes you think about being in an emulation, and really digs deep down, and makes the viewer wonder, would have been a much better film.
yes lot of words are blocked,
If i pay for it i use it as i want. But my payed AI video generator (of another brand) also allow me to do that and have done several versions and its sure fun. 😃👍
14:45 you mentioned that you preferred the image that MJ generated, but in the end, Flux followed the prompt faithfully, while MJ didn't. I don't think you noticed it, but you asked for a yoda motif on the t-shirt, yet MJ generated an R2D2 imagine on the t-shirt.
On the first image, Flux gave you something truly cinematic, where MJ gave you more of a pretty photo. As far as I'm concerned, Flux followed the prompt much more closely as a result.
The people of MJ look often like from a disgusting advertising catalog.
Yes
14:30 Bro... You need glasses. Please tell me you confused left with right.
Was about to say the same, he kinda lost me there completely. Plus, this is absolutely subjective who is "more attractive", at least where I live, I would probably risk a beating going out like this.
x2
big glasses lol "the couple on the left is more attractive"
Right? The man on the left also has zero delineation between the undershirt and the pants, it's just one long seam like he's wearing some sort of funky onesie under the suit jacket. The one on the right didn't really get the spiky hair bit well, but it got all other prompts, and the woman on the left has the wrong iconography entirely. Sure MJ pretty consistently has more details that were not part of the prompt, but that's not necessarily a good thing if you're trying to control the output and it veers into an unwanted direction.
The Stylization of the architecture was also a clearly biased viewpoint again. With him knocking the flux image for being at 'an odd height' when the other image is the exact same in that regard, jsut wit han additional pan upward, plus the MJ one is so very stereotypically overdone as cyberpunk which was stretched to beyond even it's norm, with large swaths of the building being plastered in screens where it isn't coherent, whereas the Flux one has more clean stylized buildings above the walkable visual height. Honestly, they both lost in this one to me. One was way overdone, the other could have used more detailing but it at least looked like a proper building, but his bias for MJ clearly shows time and again.
Then of course, there was the 'slightly misinterpreted' astronaut picture of MJ's that completely failed at the prompt for the astronaut to be hatching from the egg, and how he only allowed 2 shots from the hand comparison for Flus to MJ's 4. He might as well have just started the video with 'I like MJ better regardless of results' then closed the video out.
Thanks for the video. What I don't understand is why you compare MJ 6.1 with FLUX 1.1 Schnell and not with the better FLUX 1.1 Pro model.
Would have been more interesting, I think, to compare Midjourney with the Pro version. Sure it would be more expensive, but probably still cheaper than MJ and I bet the results would have been far superior. Anyway, still an interesting video and appreciated your examples.
Fantastic stuff - Im a regular user of Freepik as its less sensored than Adobe and with my fashion model Photography work im able to successfully use Freepik with NSFW projects for retouch and expand and creative upscale all in one tool, whereas photoshop is painful and rather frustrating to try to get it working with any shot that shows more than a bare elbow. Additionally, Freepik just last week sneaked in the Flux model as part of their options and its a great addition
You're comparing Midjourney's best model to Flux's smallest and least-capable model. The fact that Flux schnell generates anything close to Midjourney is a win for Flux. If you compared the Flux Dev model, it would be about equivalent and Flux Pro wins out.
Another point is that he is choosing the best of four results in MJ everytime and comparing it to the first Flux image which was generated 😂
With such SFW demands would need to shut down 90% of Hollywood's studios, and with them half the city as part of their outside infrastructure. These restrictions apply not only to Midjorney, but also to most online AI-graphics services. For example, they refused to depict Madonna with a baby - child, partially nude, religious theme. That's why I balked at these AIs and bought an RTX 2060 (for the start). 8 GB VRAM and 32 GB RAM on an Intel Core i5 7600 is enough to run Flux.1 dev - runs an order of magnitude slower than the regular models, but the quality is significantly higher and - no restrictions on content. (Translated from Russian in DeepL)
15:42 -- i like the contrast and stark lines on the Flux city scene... MJ often adds TOO MUCH detail these days that clutters the image, unlike older MJ versions that are less "busy"... when i am compositing images together i don't want a bunch of extra "fake" detail that is just clutter... if that makes sense?
You can get around that a little with settings or prompts, but I know what you're saying, like it's trying to hard. But actually that is what I like MidJourney for, it's more artsy, more abstract and surreal, I feel like I'm creating art that doesn't exist. Now Flux better not go that way, because it's like earthbound dead realistic and amazing. So I use both, I hope MidJourney doesn't try to be Flux, I mean fix the body issues, but they better not lose the wild unrealistic artsy style.
@19:57 I'm using flux-dev and flux schnell on my computer in ComfyUI with a 4060 Ti 16GB.
On dev with a simple scheduler with 20 steps for 1024x1024 it takes about 50 seconds per image.
On schnell with a simple scheduler with 4 steps on a 1024x1024 it takes about 9 seconds per image.
I'm new to AI image generation locally but I'm very excited about what can be done on a ($450 retail) / ($250 used) GPU.
Same, i'm using Flux Schnell on 1650 4Gb 4 steps, takes 3 mins per image 640x360 but upscaling it to 1280x720, takes longer, but quality is still amazing.
Seeing similar generation times on a 64GB M1 Ultra processor. My fingers are crossed, hoping we'll see some optimizations soon!
how u run it on your pc?
I think they use the censorship as an excuse so they don't have to "work" all the data. IMO. They could easily make a section for adults only. All of them could. There are "R" rated movies and scenes in the real world. Are they serious about creating art or not?
I hope, just take of the guardrails for those that pay and sign off on a section saying we understand, also we should be able to flag something we feel is over the edge, I mean we don't want any pictures of violence against real living people, or nudity of real people, they can keep logs and kick off offenders after warning, and we too can flag stuff we make that goes off the rails. I had a few results that were like ummm I didn't ask for that, so the AI can if the guardrails are off do some really bad things, we should be able to flag and delete, and they should have ways to monitor, but you know it's not easy so I kinda understand, I guess if everyone was good about it, but just look at discord and what Dalle3 did initially, a few people ruined it for us, people tend to go to far to shock value, probably kids, it's understandable too.
But yeah they make movies like SAW that are crazy sick, I can't even watch, but I don't mind, I just don't watch. But like we make an image with a person tied down at gunpoint for a crime comic cover or crime book and that's bad? I try to do comic covers like the ones from the 50's pre code horror and get BLOCKED. Open source will fix that, but I want Dalle3 and MidJourney to add R sections.
Before, I could at least understand it when it was using discord, but now that they have a version that runs off its own site, its increasingly strange to censor things to this extent. I guess the only reason they censor anything to to avoid getting into hot water regarding political figures, CP, and whatnot. But the idea that they cover women to the extent that they do is still baffling. I'm not even prompting for nudes, but it still refuses to show leg and shoulders a lot of the time.
@@gmcubed You could be forgiven for thinking that Midjournbey was programmed by the Taliban
Nothing is more frustrating than creating a detailed prompt, only to have Midjourney tell me that SOMETHING is a banned, but doesn't tell me what it is. Then going through every word trying to figure out what could potentially be objectionable to MJ, even though it all seems fairly innocent.
I prefer " variety " to the more contemporary and often politicized term "diversity." To me, "variety" captures the essence of differences and distinctions more neutrally and inclusively. It emphasizes a broad spectrum of elements and experiences without the potential ideological connotations that "diversity" might carry in current discourse. By focusing on "variety," we can appreciate the richness of different perspectives and contributions without the baggage that sometimes accompanies the term "diversity." This choice of words helps foster a more open and straightforward dialogue about differences and their value to our interactions and environments.
"Diverse also somewhat suggests / connotates "incompatible" or at least I think it has some of that baggage in common usage. "Variety" is not a bad synonym...
@@kaizen5023 Variety is inclusive and, importantly, not "WOKE".
This is a brilliant observation!
@@excelerator well thank you very much for this wonderful compliment. It really makes a persons day when somebody else acknowledges another persons observation.
Regardless of quality. I don’t do image generators that censor
Thanks for a very subjective review of both, I do however find the Flux characters to be more photo-realistic. The first one of the girl, as soon as the MJ one came up, even though the lighting is superb and you could even see the tiny hairs under her chin, something about the skin just screamed "computer generated" to me (whether that was because it was too perfect and didn't have any imperfections in it?). She looked like a well modeled game character where as the Flux, whilst being 'scandanavian noir' as you put it, the skin tone and texture looked like she was real. The same with the couples, again the MJ looked like well modeled game characters and even though the rest might have looked funny, the Flux characters skin tones and features looked more realistic to me.
No that's not how I see it with those couples in the images, The MJ version looks like two left wing hippies, and the couple on the right look like conservatives
Fascinating - you think the couple on the left looks "much more attractive"? They look gross to me lol, girl caked in makeup, both covered in tattoos, guy's hair looks dirty and messy, and not a fan of the guy's outfit at all. I'd give one point to the left girl's shirt over the one on the right but that's about it from my perspective. From a technical standpoint though, the pic on the left is more photo-realistic in my opinion.
I thought he switched left and right (I know I do at times, but not this time. I think :) ).
Wait.. he said the tattoo girl is attractive? My mind thought he said the other way round... Lolzz
I'd take away your point for the shirt, because the prompt asked for a "Yoda-themed" shirt, and the girl on the left has an R2-D2 shirt instead. :)
@@BrooksMoses Fair point!
14:50 left couple is woke
In my test, Flux seems to be really good at creagting images that are easy to run into video using Kling.
Flux seemed more realistic, while MJ seemed more PS5 gaming images. Both finger pics did have 5 fingers. I think Flux Pro would be incredible.
You've taken the most basic option available in FLUX and pipped it against the best of what Midjourney offers! Not only that, some of your conclusions are fundamentally flawed especially with regards to numbers of fingers. It appears you count fingers as well as Midjourney does!
People pay for AI generators they should be allowed to make whatever they want without censorship, a lot of AI generators are so prudish.
I train my own models on my own art and photography with flux and the sheer POWER of piloting a 16 channel pipeline like this is.. difficult to articulate. It's also free. Of course, my hardware wasn't, but all things considered; I don't think an RTX 4090 or the odd lease of an H100 cluster for larger trainings is all that much when you consider the sheer gravity of what you're getting in return.
This stuff really is science-fiction-good. It's blowing my mind daily.
Thank you Samson, I just Gave Flux Schnell a go for the things I use it for (elements for book publishing), and it's the ONLY text-to-image AI tool tool that rivals midjourney for me. Really impressed with some of the outputs. I like Midjourney for some things but this will become a regular too.
I love watching your shows your humor while staying consistently serious is timeless and hilarious
13:42 The Schnell image has actual got a full compliment of fingers - If one looks closely, the index finger is positioned flush with the stick.
How do you install flux to process on your own computer? I like the pictures from flux better.
Couple in left looks like they migrated from village in developing country,and showcasing themselves as "cool" and "style" while looking like people who are raised on tiktok.
On the right-more or less normally clothed people with style (except for painted hair).
In both cases where you say FLUX only shows 4 fingers, I can easily see that there are 5 fingers, but one of them is not clearly visible because of the position of the hand. At t=50, the little finger appears to be wrapped around the camera the hand is holding. And at t=13:11, the right-hand index finger is clearly visible behind the staff. (You're paying $30/month for Midjourney and nothing for FLUX Schnell.)
Yes I see the same thing with some fingers obscured due to hand position.
Exactly. How does he think the hand is holding the xcmera?
You are the only one in the comments to have noticed this. Bravo.
As good as it is, there's no way I'm using something that works through Discord. The end. 🤐
You know that there you can use MJ website after a 100 generated images, right? I no longer use Discord for a loong time
@@darkregenttyou mean you can start using it on a website instead of Discord after you create 100 images?
@@darkregentt why won't they let users use their web app from the start?
@@nexusknight7 Dont know for sure but what I think the reason why is they still want to people to join the community through the discord, if somebody have a problem with anything, they ask through the Discord and they still want to expand the community here because of reasons(idk), so the 100 images condition is just because of that. On the MJ office hours they said that it will be soon that you dont need discord at all and open it up for more people, so thats it.
I could seriously following training content developed by you, your way of talking does not bore me.
No point arguing about matters of taste, but I like the couple on the right better around 15:00. Couple on left looks shabby and trying too hard.
I like what Flux is doing, I think that the first prompt image of the girl in Flux looked more realistic than the girl in MJ.
midjourney appears to be generating a more "atmospheric" picture, but it is compared against the weakest flux, so I would expect some disparity. with the couple picture, the one on the left was more in stylistic similarity, but the worman on the right was more natural, even if the man was a bit "painted" in clothing, and not so much pared in style.
I'm running dev on my desktop, works great! And oh, it's hard to forget it's FREE! Besides the cost of the watts anyway, but way less money and less restricted than other solutions.
you totally forgot to mention Stable Diffusion with its Loras. You can generate custom content that is 100% impossible with all the other tools ( right now ). Image your kids draw their own character in their own style. You can make a Lora of it let it go on an adventure =)
The problem with Midjourney is that it is too censored, you çannot create erotic scene, even if it isn’t really erotic, you cannot create war scene with blood. And finally the price is quite high for something with so much restriction. They need to do something against it before to lose the battle against other systems like Flux, Fooocus, etc..
For me it feels like MJ just has its own distinguished style while flux needs a little bit more guidance.
And the comparison feels very biased because you know how to prompt for MJ to get the results you like and apply these prompts to Flux.
So in the end you just find out how well your MJ prompts work in Flux and are not really comparing the two models..
But I understand the lifecycle of YT doesn't allow for waiting until we figured out how Flux prompting works ^^
Your comment on the couple does not seem right. Btw, i am 52. May be i am old.
I'm going to have to give the win to Flux here. My promt "Space Sherrif T-Rex holding a Winchester rifle" gave me a dinosaur hold a rifle (granted, it was a shotgun) but still better than anything sfw/violence censored which at best made my dinosaur holding a parasol.
The platform that will master character consistency and very flexible policy content will get it all. Period
Your "subjectives view points" on the matter are shared by yours truly.
Great video!
I appreciate that!
16:46 this is always a flaw in people's criticisms and or reviews of AI. As you have stated this is a German company if you look at the man and woman that they have created it is very German in its look. These are bias that are put into every AI whether it is an image generator or a chatbot. There is always a bias left behind by the programmers and those that engineer chatbots.
thank you, been spending money and working these things out for myself so seeing what you think is cool.
Well... for the "censorship" part i found out that Flux (using it locally with ComfyUI) is ok with boobs and butts but doesn't want to draw the rest... so it isn't really uncensored; non totally at least
13:46
Midjourney (V6.1) adds a woke bias to the prompt and got the following wrong:
- age - adolescents instead of adults,
- suit colour - gave lilac pink instead purple,
- t-shirt graphic - gave R2D2 instead of Yoda.
Flux (Schnell) follows the prompt more accurately.
Midjourney images always seem to have a sameness to them, oversaturated, like all Thomas Kinkade paintings...
14:50 ... the couple on the left is more attractive? They look like they shop at Goodwill! 🤣🤣🤣
I just had to laugh at how "schnell - fast" you pronounced it in German :D so cute. I'm curious to see what Flux is like, I'll give it a try.
In the 1st comparison, the Midjourney model seems from a game, it isn’t photo realist. It is artistic for sure, but nothing to see with a photo taken with a digital camera on the street.
Canva bought Leonardo and Affinity and what a dream would it be if Affinity could use Leonardo inside Affinity Photo or Designer !
Leonardo is a basic Stable Diffusion with a fancy interface. I dont think it does any single thing better than everyone else.
@@karlpj1 I am talking about Affinity Photo etc getting AI inside their apps.
@@karlpj1 I stated imagine if Canva who know owns Leonardo and Affinity will offer AI inside Affinity apps like Adobe does.
I think you did not understand what I wrote.
Sites where you have to Log In, give your email or even your phone to get to play around isn't "Free". Make mention of that in your next videos. "You have to log in or make an account" etc. Giving your info is still a price to pay.
CENSORSHIP is a serious crime in art. Felony.
Most artists are actually taking data from movies, shows and other sources of art. We/they just don't realize it because we don't store that data like AI would making it easily retrievable. I know of well known artists in all fields that that are similar to other artists, but with their own unique style, and when you ask them what inspired them, they usually reveal the source. From there, you could find the similarities. Say the movie Pulp Fiction. Tarantino literally tells us his ideas came from the writer Elmore Leonard. And he's right. But should Elmore sue everybody that does similar things to his work? Who inspired him?
how is this shitty argument not dead yet :D
even copying a style 1:1 takes years of dedication, skill and effort, while also being a conscious choice. A person can choose a picture as reference, but they can also IGNORE a picture. AI does not choose what it trains on, nor can it choose to ignore certain pieces.
also, as any experienced artist will tell you, the biggest inspiration of an artist is the real world.^^
btw, we store data, AI doesn't. We can recall an image we have seen and, if we are good enough, recreate it 1:1. AI can't, because it does not store images, it only breaks down every image into a vector with much lower dimensionality, trying to extract the most important features, then updating its model weights accordingly.
@@thomasmann4536 We are a more "advanced" version. But you can train simple AI models, and draw a basic chicken. Then train the AI, like the artists have trained, to make it a different style.
I'm an artist. I can draw. Just like AI I used a graph to get the 3D right. I even use my hand to make similar shapes of what I'm drawing as a reference. AI is just a "tool' It can't innovate like you or I can "yet". But, if you are an artist, use that tool. It can lessen the workload. Maybe learn how to train your own model just for you. It's pretty cool stuff. You can layer your models and advance your styles much faster.
@@insurancecasino5790 my friend, what you say isn'T wrong, though also not fully right. btw, im a 3d animator but my academic background is in AI, Ive studied NLP under Björn Ommer, one of the creators of stable diffusion, and the tech is always fascinating to me. I actually made my own models, and tinkered around with the tech. The biggest difference between using conventional tools and AI is this: you can get really good at Blender or Photoshop and ten create EXACTLY what you want CONSISTENTLY. But no matter how good you get at using AI, you will never be able to generate exactly what you want consistently. it's just the inherent probabilistic nature of these models. ya, you can use them as base, or inspiration, which i often do. but in the end if you were to rely on AI entirely, youre just trading a skill limitation for a model limitation, which is one you can't overcome yourself.^^
@@thomasmann4536 Depends on what type of art. If you're brush is random to see what might inspire you, AI is "perfect" for that inspiration. I generated a chicken with sunglasses. Welp, it put these giant sunglasses in front of a tiny chicken on a rock in a country style landscape. So, I asked myself, who does those glasses belong to? A giant? Then it gave me the surreal idea of a giant cooking his chickens with magnifying glasses that look like two magnifying glass made into magnifying sunglasses. You don't realize what's going until you look at the chicken, and notice it's actually cooking. I would have never come up with that without AI. And there's the surreal connection to "sun" glasses.
@@insurancecasino5790 thats an unfalsifiable statement. nobody can prove you wrong on this because it already happened and now you already have this idea in mind. You might have not gotten this idea had you not used AI. But you would have probably gotten a different one. Maybe a better one?
I've noticed that a lot of models will generate generic persons (without specifying race, age, sex, etc.) in a characteristic way because they've been trained on specific sets of provided images, possibly involving the hiring of models from the local region. Certain Stable Diffusion models (e.g., Deliberate V2) seem to favor Asian looking people, while Flux tends to generate people who look German. But they can be a little more creative and flexible with the right prompting. I definitely look forward to trying out LoRAs with Flux to see how well they help to steer Human characters towards a desired look.
So flux is the uncensored AI? Have you tried it? I have tried free implementations and NSFW content seems to not fly.
In fact, in the architectural prompt, I really prefer Flux's result, because MJ's might look amazing, but it is a bluff in realism and actual architecture sense, if you would try to 3D model MJ's result you would suffer trying to make true, makes sense shape that can replicate that result. Where as in the flux result, the shapes are clear, defined and makes sense of how the building is constructed. I would much rather take the inspiration from flux than from MJ to achieve something "Real"
With the advancements of softweare and hardware self-hosted AI servers will become a norm in the near future. This would be an amazing time for creative people =)
I guess it's all subjective. The couple you preferred are what I'd be happy with from a "garbage humans" prompt...oh well...
I used all 3 Flux, and I can't believe you made a whole video with its lowest level to go up against MJ. I must admit tho you got some incredible shots from Schnell. Flux Pro should be your next video. Even Flux dev is much better than Schnell too
Flux is really amazing, being able to run it locally with 4Gb Vram and 32 Gb system ram it only takes about 3 minutes to produce very beautiful images. The NSFW does need a bit more development, but that's not what i'm interested in, i'm just happy i can run it locally and play around with it, oh yeah, and flux is free.
Rating the images you created:
1. Flux looked more realistic than MJ.
2. 4 tries vs 2 tries might not be very fair, but hands are difficult.
3. You saw the result, but flux can mess it up trust me.
4. The king from Flux does look more realistic and both hands do have 5 fingers, MJ didn't show hands so hard to fault flux there.
5. The flux couple were prettier imo, but the background for MJ was better.
6. MJ had the better image, but again Flux Schnell is their lowest quality model and still free vs paid MJ
7. Both did well, Flux interpreted better but looks a bit 3 model like
I would say that Midjourney is great to create graphics or characters for video games, a little bit like Unreal Engine. we cannot tell that the quality of the skin is photorealistic because it seems always a little bit synthetic. If you take a picture with a camera you will see the difference I think, very easily. But the artistic result is very good for sure with great DOF, color range, etc.. the example of the happy king, we can see that it isn’t realistic. If you compare the skin of the Midjourney King with a photo of Theoden from the Lord of the Rings movie, you can see the difference easily. For an animation movie or a game introduction, it is perfect, very artistic but I’m not sure that it would be good enough for a real movie like the lord of the Rings, or maybe for very fast and short scene where we cannot see the difference.
Extremely good at following specs and artistically spartan. Yep, it's German.
as an architect the Flux skyscraper makes more sense on the realism aspect, also the scale and proportions are more believable, the MJ is not bad but is more focused on digital art examples that I do like, but are not as detailed and as believable, also the king prompt in flux had 5 fingers, but one was behind the sword.
Flux has good image quality, and it can do text and signage. However, Flux is also censored, and its generated human models still have six fingers and six toes per appendage. So not yet impressed. And I used the Pro. Let's see what happens in Pro.2
Midjourney was an extremely traumatic experience for someone so excited about AI art it was unreal. You can’t make anything even remotely creative with it. I mean truly creative. The censorship just plain cannot work. Your first clue is that god allows free creativity THEN selects in evolutionary processes. Your second clue is that every word processor, the printing press, etc followed the same pattern without question because true creativity by definition cannot be controlled or anticipated. You cannot remove words in a language because they might be misused. Its so stunningly 3rd Reich “degenerate art” law ridiculous that I was speechless 😶 Pun definitely intended 😂😂😂👁️
I was just comparing Flux Schnell and Flux Dev on my local machine and my 12gb RTX 3060 and 32gb ram handled both fairly well in comfyui. Schnell took about 40 seconds at 4 steps and Dev took about 120-140 seconds at 20 steps. The Dev images were much more detailed and realistic, but took longer. That's the trade-off, I guess.
and the big downside of not being able to sell them? I don't understand why commercial use is not allowed for dev......
@@gatwick127 You can sell your outputs. You just can't set up a service using the models or derivative models and charge for it.
Here's the part of the license that talks about outputs:
Outputs. We claim no ownership rights in and to the Outputs. You are solely responsible for the Outputs you generate and their subsequent uses in accordance with this License. You may use Output for any purpose (including for commercial purposes), except as expressly prohibited herein. You may not use the Output to train, fine-tune or distill a model that is competitive with the FLUX.1 [dev] Model.
flux 1000%. its open source. anything open source gets an extra 20 pts to begin with. i feel there was shade on the first test towards flux and you rated it based on your prefrence than what it was told to do. keep in mind youre also using the lowest end model of flux. if you were gonna have it be hosted on a server shoulda gone toe to toe with the pro to be fair but im glad you showed off the consumer grade cause now im going to install it. thanks :D
Runway also has major restrictions, no one can generate actual engaged battle scenes etc
Yup, it's very strict, I might jump ship unless they loosen up. You can't even do a normal woman in a bikini on the beach, or horror. I get some false positives too. But it's still crazy good.
@@TheFeedRocketwell I got censored for a drilling machine 🤓
Im hoping AI can increase the restoration and colourisation of old movies, to give them a new lease of life
The power of open source is, it constantly leaps forward because thousands of developers work on it, while close source is limited to a handful of developers. FLUX is very promising, open source, uncensored, only a few weeks old and it's already taking the AI world by storm. I'm sure it will surpass all the other AI platforms, sooner or later. Oh, and open source is FREE.
Hey! Great video. Just a heads up, I noticed a few mouth noises that were a bit distracting. There are tools that can help clean up mouth noises and make your audio more polished (like iZotope mouth de-click, which it's very easy to use but I'm sure there are other ones).
Good work again, towards the end what is the name of the software that helps with character consistency? Render something. Thanks much
Prompt adherence -> Flux, image quality -> MJ
I think this is simply incorrect in a foundational term. Obviously not disputing your valid opinions on things, however objectively, you asked flux for something, and the question should be did it provide what you asked for.
I have used midjourney extensively in the past and now locally run flux dev.
The reasons i disagree at the top level are -
First you compare the ‘worst’ flux model against the best midjourney, its not a like for like comparison.
Then you openly state midjourney produces 4 images and you pick, most of these tests you give flux 1 chance.
You also state that both images meet the prompt but you prefer midjourneys bright art style, was that in the prompt?You state flux will follow the prompt better, so you could say you arent asking flux for an art style you prefer, so itll never win?
On flux dev, i have many pictures of hands, not just human hands and it fails perhaps 5% (and thats being pessemistic) of the time, in not giving what i expect in terms of biology for said creature. Id imagine this is better on pro even.
Im obviously not intending to dismiss your opinion, MJ is beautiful too, i just disagree with the testing methodology you have come up with between these.
It seems to me that you personally prefer MJ output art style and that affects your objectivity on this.
yes it is free but NO, it is NOT uncensored
amazing video you got a new sub. but I personally prefer Flux's color palate on many of the images but MJ did really good more so on the city cyberpunk style image loved the MJ details on the buildings. but that is just me. lol
Would like to see a head to head using actual photos as prompt between MJ and Flux.1 or a comparison of MJ and Flux.1 blending images together.
Flux isn't censored ? Haha ..
I still use SD. Uncensored and free. I ain't gonna let these guys take a single dime from me.
dont you need to keep buying credits for all the image generation?
@@rexaustin2885 SD is always free. It can run locally on your machine or from a server if you know where to look. Abuse it anyway you like.
MJ doesn't know the difference between R2-D2 and Yoda...