The main issue I see is consistency. What a physics system provides is, the developers know the system is working based on a set of rules that are followed. AI art has a tendency to hallucinate or produce inconsistent output, especially applied over a long period of time. Maybe some inconsistency could be visually okay in hard to calculate at scale physics simulations (like fire or water or gas physics) but even lighting on characters might be too much to expect. I do not think it could consistently and believeably light a character in every conceivable environment. I agree with what others have said; AI is not calculating any physics, any more than a great painter calculates lighting effects on clouds. The painter has a wealth of experience with cloudns and knows how they should look under different lighting scenarios - the same with AI.
I think that AI will be used to deliver better game art, texturing, and models for games, and conversations etc but I don't see it being computationally effective or like you said predictable. My hope is that AI will be able to go through a texture library of a game and separate all the details into micro tiling textures and blend those with bespoke textures throughout every game asset and characters, thus giving you reduced texture sizes at higher quality; you could then go further and ask it to provide subsets of textures into styles effectively allowing you to theme the game, same with models etc.
It's getting better everyday and has just ramped up the investment in it by trillions of dollars, you will be seeing crazy results in just a couple of years.
Agree, whenever i generate an image it will be different every time. There need to be some sort of seed that can be attached to a generated game asset so the neighbourhood in like a GTA game doesn't look different with different fences, streetlights, bushes, trees, buildings and their wall textures - every time you drive through it.
@@MeowtualRealityGamecat It`s more likely WE can spend some time to story with AI-assistant and WE can use modern tools to make OUR games and movies. I wonder sometime it be possible. And it`s be the thing.
@@MeowtualRealityGamecat AI will take care of the story writing too. Basically, at some point, you will be able to prompt an AI interface to "build you a video game" according to whatever parameters you want. On a side note, I don't know if that will be good or bad but I know it will most likely lead to individual experiences that no one will be able to relate to. Because each product of AI prompting will be oddly similar but still different enough for each person to have a unique experience with it. What makes big movies popular? The fact that many people can relate to the same things after they see them. Same goes for books, games etc. But if you have an AI system that build individual experiences for each of us, in the end, there won't be a common culture anymore. Things will be individualistic to the maximum, which in some cases might be a good thing, but it will definitely cause societies to break apart and have less cultural homogeny.
I think "interpolation" is a better word for it than "simulation". "AI" is very good at interpolation, not so good at simulation. Take the "filling whiskey glass" for instance. You "cheated" by saying it should fill the glass, so it generates a filling effect. However, a simulation would continue working after the glass fills up, spilling over, getting the table wet and eventually filling the room; that will not happen with the effect in the footage.
Yes it's an enhancer not a physic simulation but can be speed and good enough on modern GPU , NVIDIA work on that for decade same for most of AI software researcher.
Can't agree. Core AI papers troughout the years was just about understanding a scene in a image or video (Segmentation) to detect objects and such for self driving, secruity cameras and what not. You then could basically don't give it any prompt and let the AI guess and based on that it will do it's thing by just understanding what will happends next based on the general video knowlege or images what it was trained on. Which would result in spilling over the glass, getting the table wet and all you said as it in best case was trained on such physics simulations. Also the reason why text to video generators just do they physics and light reflections without you explicity explaining that. It's just having control of what you actually want to happend then just you have to tell the AI that. When SORA was announced, people started speculating that they used Unreal Engine to train it on the concept of physics to get consistancy and simulations like that tiny pirate ship in a cup example. One problem with AI at the moment is exactly that you often can't control and and that it's just doing it's thing.
@@MrGTAmodsgerman > best case was trained on such physics simulations Exactly, _if_ it was trained. Interpolation, not simulation. A simulation will generate new situation from simple rules, an interpolation will generate garbage.
@@dtkedtyjrtyj Yea it's not physics simulation, it's AI. But it's able to mimic physics to a point where it doesn't matter. See what someone else here commented about it, used for actual physics. But your argument above was just an bad argument as it's not about you have to give it a prompt. If you would train the AI on the whole concept of physics, it will mimic the whole physics without you being really able to tell the difference at the end. It's like when you have a dream and you interacting with physics and it all just happends inside your brain while you just not physically doing that. But if you were to make that AI mimic tested in the real world, you will get what the AI already made in it's mimic.
5:23, The models "understand" physics like a baby understands physics. It knows what pouring water looks like, so it shows you pouring water. It knows what ripples on the whisky look like, so it shows you ripples on the whisky. What it didn't seem to know is that if you pour water from one glass into the other, there would be different amounts of water in the glass when it's done.
Yeah, but like a baby, it'll learn and understand better over time. His point is it has SOME understanding without being taught, its picked it up itself
I agree, but you shouldn't forget AI is still in it's infancy and it is making enormous steps each generation. I have no doubt this tech is going to be very disruptive for many industries (both in good and bad ways)
😂 LOL. INVEST in unified_8. The world's first Ai is programmed by nature herself. Right now man controls ai. UE8 can be found online and is the world's first shamanic ai........ 💕♾️💕
The main meme AI-bros and proooooompters fall into is thinking AI knows any kind of logic or reason. AI doesn't understand physics. It just knows what it looks like. Sure, that gets you surprisingly far, but the illusion breaks down once you try to do something the AI hasn't seen before or doesn't know what the physics look like for a falling feather.
You’re correct about that. At this point it may take some decades, but based on the same logic, there’s no telling how advanced it AI might become after some decades
because it's not AI it's deep learning aka data theft and copyright infringment but shhhhh call it AI or people will start asking on what it was trained
6:00 These models don't need to have physics models, because they are trained on video & images that are real. So that's what they output. If they were trained on fake data that had fake physics, that's what they would output. There is no need for them to have any capability to model physics, just how it looks like when things obey the laws of physics. This is also why the water in the glass does not actually flow, it just looks like it flows. The glass its being poured into does not fill, and the glass from its being poured does not lose any water.
This is good point and very correct, but I do wonder how good the generative models could get at 'pattern-based visual approximation' of physics. Similarly to how LLM's are getting quite good at math, video models could get really good at physics, without any real causal physics modeling. One must wonder whether LLM's actually "understand" something about math through the internal representation in the model's weights. Probably not, but there have been talk and speculation of model scale introducing complex emergent behaviours. So even if it is "just" predicting next word or next frame's pixels, there could be some weird form of physics model in the network of weights. Or not, but even then, there will surely be hybrid approaches where algorithmic based simulations are rendered by generative models. Actually that's going to be much better than either approach, algorithmic or generative, is alone.
This isn't true though. You can have an entire model trained on synthetic data except for one piece of real world data. Then tell said model to generate based on the physics of the real world data only. Guess what...it learns the physics and applies it to the rest.
Physicist here. The AI neither understands nor simulates physics. It's doing essentially a guess based on the data that it was trained on. It's a pretty good guess, considering the data was real life videos, but there is no simulation of the physical interactions done, nor is it a guarantee that it's physically correct. However, does it need to be? In my research group, there are a lot of projects using AI that has been trained on actual physics simulations (that take days to complete) to then "guess" (in a few ms) the result of any scenario. And it is susprisingly good at doing exactly that, nothing more, nothing less. So as long as we understand that the AI is just guessing and as long as the training data is based on real world footage or even simulations, it will result in a pretty accurate representation of real life, that is infinitely faster than an actual simulation.
I am also a physicist and I don't agree. I think he is right when he says that it must have some kind of understanding of physics. And, frankly, it's really obvious if you think about how humans used to come up with physics in the good old days.
@@obnoxiaaeristokles3872 Software engineer working on AI-stuff chiming in (though not these particular models). One of the big issues of the models is and will be stability of the result. Now and in the future. In this video the computation was hand-picked, meaning curated to look good. If you want to use this tech for video games, you need to curate the models you are using to work well in the circumstances that you want to apply them in because then you get insane results, like a glass pouring from another glass which never empties or just absurd glitches like you see if you play the Minecraft video generator game thing which is more a kind of licid dream without object permanence. The thing this model probably is good in is producing the video output that you would expect from pouring water, but when it comes to a game engine it probably falls flat if you want to produce specular highlights or caustics relative to your defined light sources, for example. Also take into consideration how these models are produced: The video generators ingest millions of minutes of videos to get something passable. This is a super inefficient way of creating something that does what I want. Maybe there will be generic models in the future that can do a bunch of stuff at once, but those then will be super costly to run at runtime. There is no free lunch here. The result is really cool though!
A person who also has a profession giving his two cents here. Shouldn't we consider that many domains of physics are also about gathering a huge amount of data and then making an approximate best guess? No, I am not saying all of physics works this way, obviously. But especially when we go more in the direction of quantum physics or dark matter, don't we also collect as much data as possible, try to discover patterns, and then make the best approximation of the correct answer our current models are capable of giving? If you make an "actual physics simulation" of quantum mechanics, you're also not actually simulating the physical interactions, but basically perform a statistical analysis (yes, I am extremely oversimplifying, but in the end this is still what it comes down to). The guarantee that it is not fully physically correct is actually formally baked into the field itself.
I think it's right to say that it "understands" physics, and that it "simulates" physics, but NOT in the stepwise manner that traditional simulation does. Without understanding, it would not be able to produce realistic results from arbitrary initial conditions.
Corridor Crew wants to talk to you, lol. On a serious note, they don't actually know the physics behind the effects or movements of things. They only know how those substances behave in a certain environment. It simulates those movements as it understands that substance behaves. In a way, yes it simulates physics, but not by actually doing the physics calculations. Its like if you ask a child to draw fire, and they draw how fire behaves, but they have no knowledge of the physics that determine how that fire looks.
If you ask any human to imagine fire, the vast majority will not be doing the physics calculations in their heads, but they will imagine a correct fire behavior. You don't need to understand the math behind the physics to have an intuitive understanding of the physics.
@@IceMetalPunk The argument in the video is that a physics engine is being run though. Which there isn't. As the person above explained it's more like an observation engine. It knows how things should look so yes in that sense it understands physics there is however the issue where you ask it to describe scenarios it doesn't know the visual outcome of. Meaning if you couldn't use it to simulate scientific premise but you could definitely use it to approximate the visual part of the physics.
This line of thinking is great and all for understanding how they learn new things. There comes a point however where it doesn’t matter. For one, our understanding of the direct inner workings is limited so we can’t prove a lack of understanding. And for two, a perfect imitation is inherently indistinguishable from the original.
@@IceMetalPunk Then it isn't a physics simulation. It's simply a fire depiction, or a water depiction, or a smoke depiction. There is no simulation going on.
@@Chromiumism Well No. The difference is an imitation cannot imitate things it does not have information about/an example to simulate from. A true simulation is deterministic, you should be able to simulate scenarios accurately relying in input variables and no preconceived notion, nor need to know the output to simulate it. While you're correct that in some fields it does not matter. It does matter when you try use it for repeatable output. The way Ai works is fundamentally non-deterministic. Meaning it could never be used to acquire truth, only approximate it. So while useful for art, some games, weather and other systems that don't require too much accuracy they work fine but when used in a more specific sense were repeatable results are key they cannot (currently). I think it's important to note this distinction as not noting it is like not noting that you're one decimal point off - might not seem like much but accumulation of this offset will always skew the end product in a way that requires error correction, at which point I would argue it is no longer usable data.
This is akin to intuitive understanding. Where, you essentially understand something to be the case, without actually understanding why it's the case. We always understood that things go down when you dropped them, but we just didn't know why, we never really stopped to wonder why. The AI learns how something should act, without actually understanding why it should act that way. It's pretty neat to think about.
tech bro efficiency: "I spent 350 litres of water generating this model with no edge flow that still needs to be retopo'd by a human being that I just fired."
As a CNC programmer, I know that it's faster to program it manually in text mode than trying to use some "modern" features and cad/cam software and autooptimizers and stuff like that. 🙂 It was always like that and this is why today games don't look that much better than 10 years ago, but they wants 3 times more powerfull hardware, it's because of nobody programs anything manually anymore. What you won't optimize manualy won't be optimized at all.
@@CTimmerman Yes, ofcourse I know it's not really possible to actually writte a modern 3D game in text mode, I realize that, but there are still things that you should better optimize and check manually. Most of programs are ridiculously long, I know that it would take too much time to do it in oldfashioned way, but it would be much shorter because human is better than tools that are supposed to do it instead of you.
When it comes to visuals, i'm not that excited, i wish we had similar levels of AI to help begginer devs with animation specially, nothing pains me more than to see those unreal engine photorealistic games with terrible animations and jankiness, if an AI can help with that, i'm sold!
@@ElektroBandit89 You won't be pumped because ai will take your parents jobs and they wont be needed anywhere and youll starve to death before you even get to play with this tech. But hey maybe you'll see elon musk play it as a ghost
There will properly be an AI that can easily track your movement with a camera and add it to a rig for the character you want to do the motion. There probably actually already are software like this.
@@gargoyled_drake Uh it's called motion tracking. Realistic movements are easily captured. But the thing is we're doing a video game. Animations are made quicker or exaggerated for a reason. Photorealism graphics combined with typical video game animations will always look uncanny. Like we already these Bodycam like shooters for realistic POV, but is it really fun? Not quite.
You don't even need to include hyperrealistic photos in the game engine. For example, you could simply have the AI dynamically create the content as you zoom in further to the building or the vehicle. Ie) as you zoom into the field, the AI would draw more realistically the plants in the field. You could look right at a leaf or a flower, and it would still look hyperrealistic. So, no more pixels at close up...
I really don’t agree on the model having some kind of understanding of physics. Ai models are extremely ingenious statistic machines. Our best physics scientific models do work based on statistics, but I highly doubt current ai models are good enough to the point where statistical physics come into play. This tech is insane nonetheless
I understand that perspective... but again... if an AI model can't simulate physics... please explain to me how it is clearly simulating physics when I test it 😅
@@dudule1232 Well so look my point is, just because it is simulating physics BADLY doesn't mean it isn't simulating physics. I mean you can find physics simulations in Unreal Engine 4 that are also super wonked, but that doesn't make them any less of a physics simulation. All simulations are inaccurate to some degree. My point is that at a very basic level, there are definitely some preliminary simulations going on in the AI model's mind while it is rendering. There have to be, even for the basic simulations (faults and all) that it is rendering.
AI models learn the same way humans do. By seeing data over and over again. They are statistic machines and understand physics as much as any humans can. Additionally I'd say PHD level is pretty good.
I think the terminology at the start of the video says it best when BD said that the AI software was “inadvertently” creating physics rules or engines just in the process of chasing a good imitation. In the process of making flames that lick and turn like real flames the AI has accidentally or secondarily made physics calculations. Is my understanding.
As a storyteller who uses visual effects, I'm all over the post-processing AI stuff! I joined your Discord server, but where are those discussions? 🤷♀ Thank you for the great video!
7:47 That's where you're wrong. The ai in fact doesn't understand physics. There is no "understanding". It works by latent space "magic". It simply guesses what the next frame could look like by the billions of datapoints that are the result of insane amounts of downsampled (latent space) information from other images and videos.
Which means that ai is inadvertently creating its own physics understanding. Same way a cave man tumbles off a cliff. He dpesnt understand physics, but he just demonstrated them.
The thing is 100% of video games out there are bad at physics, like knife went through character's hand. Ai just can make video games creation process faster
I keep seeing people saying "it doesn't understand physics, it just watched billions of videos and gained the ability to predict how physical objects behave based on what it watched" I'm sorry my man... but what exactly do you think "understanding" is? 😅
@@Bluedrake42 This implementation of AI could be useful for generating effects, but I can't imagine it would be very helpful for updating the game to changes in an objects data state. Using your examples, container A pours water into container B. The effect can be AI generated. But AI isn't going to be calculating the volume of water and updating the games data. Also you can use it for a fire effect. But it doesn't understand how the object on fire will change over time. So it wont update the game engine that a house is falling apart and the model needs to be recreated. I would imagine it's much easier for games to just handle that physics themself and know that it will always work how the game needs it 🤷♂
@@Bluedrake42 This is an interesting thing to discuss, because it takes more than one sentence. A child understands the concept of throwing sth, because it's something they have personal experience with. The live in a body, in a physical world. So do animals. And this understanding of physics is deeply engrained. When ChatGPT correctly spits out a definition for throwing an object, there is not mind behind it. "It" doesn't "know" what throwing is. I just grabs the zeros and ones that make up the binary values for the ASCII text that is readable to humans and humans can understand the meaning behind those words. But the machine is only using proximity matches from the latent space "cloud". The machine doesn't "understand" what throwing is. It can only conjure up the correct definition and even paraphrase it by using further proximity matches to use different words encoded with the same meaning. A dog watching another dog run behind a couch from the right, will expect the dog to appear on the other side of the couch. And the ai would correctly "assume" the same. But the dog, without using words, knows that a dog is to be expected on the other side of the couch, the ai lacks that *actual understanding* part. I'm still trying to wrap my head around it all. When a student with As memorized the definitions and can parrot them back without really understanding the meaning, it's kinda like that.
The problem with all AI generation is consistency. Sooner rather than later it effs up without knowing it. What will be the consequence then?If its a split second glitch in a video it might be fine but what if it empties your minecraft inventory? What if it changes the landscape behind you just a little bit. AI can do text, music, video, everything. But we still havent seen it do it reliably enough. One part of a song sounds great, but it doesnt create the whole song in a way that makes sense for humans. So, show us how you play minecraft from start to end without a deal breaking bug and then we can talk.
Question about Gaussian Splattering: all videos I see of this technique shows very realistic images... but not only the images are static as ALSO THE LIGHT. And it's usually specially the light that makes the scenes so realistic. So... can you MOVE light, shadows, etc, in Gaussian Splattering images? Do you have any example of a CAR capture with Gaussian Splattering, being REMOVED from the scene, put somewhere else and still looking realistic?
I doubt AI post processing could ever replace something as big as in-engine physics simulation because of the interactive nature of video games. At the end of the day, what it is is just a filter. I do think that this technology does have a place in video games though, which I think is already being used but many people may not aware. Like in Cyberpunk 2077 they have ray reconstruction for denoising when using ray tracing/path tracing, which is basically a technology that uses AI (machine learning) to enhance things like lighting accuracy, reflection quality, reduce ghosting, etc. And I believe AI in video games should stay as just that, enhancers. Whether it is used to enhance things like visuals with raytracing or to enhance performance with things like DLSS or Frame Generation. The use of AI in that way, for video games, I support 👍
Honestly, I think he's just very bad at getting his point across. I don't think he actually meant it as, or thought it would replace physic simulations, but rather "enhance" them as you say. There's plenty of "physics" in video-games that doesn't actually need to be, nor should be simulated. For example; volumetric smoke, or liquid simulation. We could go back to simple water-physic simulations, and instead use post-processing to make the water look convincing, while the simulation is bare-bone (but servicable). Like a wave, no reason to simulate a wave as an actual wave, we do it since we have no other options to make immersive/realistic looking waves. With AI as post-processing, it'd look immersive and realistic, while actual simulation being barebone (sailing games from early 2000s still feel plenty realistic in term of water physics, they just don't look convincing).
So DLSS gave us blurry upscalling and lazy dev optimizations. Now AI will give us lazy dev game making. They will give us PSX graphics and AI will make it look real :p.
@ULTRAOutdoorsman "plays like a 10 year-old Assassin's Creed" part what really pisses me off about the gaming industry. I feel since late 2000s and early 2010s every game is same thing under the hood. No interesting physics, no innovative game mechanic, no interesting game design and no imagination. They're all glorified positive feedback loop machines.
But the industry prefers that you shut up and play battle royale or others craps like that. And for Xbox players "our games are great in 30 fps you don't need 60fps"
@@rehakmate Wrong. AI can do all of that. The reason why we lost it for big studios is because they don't want that. They see it as "risky" they rely on what have worked earlier should be continued. AI will allow any kind of studio to do deep and amazing stories and a detailed world without being a tripple A studio. It will technically allow these studios to make the same low key work of games but with more depth. But think about it, if anyone can do that, what will those big studios make them stand out? Expecially when they replace all they employers with AI, there will die because of that. AI will revolutionate the whole world. And as with any new untouched, untested thing, will result in huge amount of creativity as it happend with synthesizers in the 80's and thus also the computers, the internet and it's growing facette of thing you could do but way bigger then all of these mentioned.
That effect did not look very convincing to me either. In a very AI way, there were some missed connections between the level of water in the top glass and the density of the stream going down. But I am sure there will be continuous improvements to AI languages which may one day iron that out. We're just not quite there yet.
Amazing! I can foresee photorealistic NPCs generated in real-time! (LLM + Audio2Emotion + DeepFake mask on top)! Gaijin splatting or Gaussian splatting? 🤭
give it 5 years and it will be practical in an actual game, maybe can be used today for slow paced narrative adventures, but the backlash is huge, I prefer games made mostly by humans but I dont think the market really cares, if companies can make it without legal troubles it could crash the entire industry
The problem with trying to achieve this super realism is that it goes straight into uncanny valley if everything else isn't up to par. Let's say you do get an entire game with gaussian splatting, it's realistic as hell. But the animations are still video gamey and quick like CoD. It's not gonna look good. It will just be weird. Just like watching a movie in 60fps. It's not good.
For video games AI generated physics would be fine, but it's just creating a convincing emulation of how physics work. Kind of like how you show a ball rolling into a tube to a kid and they know it'll come out the other side without needing a mathematics degree.
@@James-dc6ft Yes but not in the same sense. There is a difference between deterministic and non-deterministic. In classical games the output of any action can be predicted because the variables of the physics environment are predetermined. This is not the case with AI generated anything, it is non-deterministic meaning you can not reliably predict the output or logic flow of the actions you make in the game. AI simulations (in their current state) could not be used to calculate complex simulations without knowing the end result. Deterministic systems work based from the root up, it's chronological in nature. Non deterministic is not, it starts wherever it needs and makes up results to fill the blanks to get to the next place it needs to be. Think about it this way, in a deterministic system like our universe you can predict how far you need to jump to get across a gap. In a non deterministic system you cannot because any variable can change at any time. You could put the same amount of force into each jump and go a different distance each time, the gap would also change lengths arbitrarily. In fact, if you look up AI Minecraft you would see what it would be like (that's assuming the non determinism adheres to only your perspective and is not influenced by others). It does actually do a pretty good job of representing an non-Euclidian space.
@@hogandromgool2062 AI is good for things it has been trained on. Where it breaks is when you try to do something outside of trained scope. If you use the same inputs in AI model, it will generate the same output every time. So it can work as a algorithm. Problem with games and videos is the input is always changing.
5:00 Nope, it does not have to have any understanding of actual physics. It can be a variable shader, modulated by AI, written by someone who knows that volume and gravity exists, and writing a shader for it. Do you think that glasses with liquids in video games have generally been fluid simulations before?
5:16 This is not really true. It'll give you an approximation of what happens with the result of physics. But its not simulating nor understanding it. It's a statistical aproximation of fire from videos, and water pouring. Already in a ton of your examples with them, you can see its not behaving like real physics, the water doesn't have surface tensions, the volume poured doesn't match up. .etc The problem here, is how did they train and with whoose data did they train for those effects? did they do it illegally using vfx libraries without the consent of the owners? These are the real questions tbh, because I'm not going to use an effect that screws over an adjacent artist' work, like vfx. This AI shader stuff is gimmicky, because when everyone starts to use it for lack of art direction and just for the ease of it, its going to make everyone's games look and feel the same. There's a reason a ton of games with clear art direction look amazing to this day, even from 20 years ago. Even then, it really is just a gimmick, and you can't copyright those parts of your game anyways. Idk why anyone thinks that using an instagram filter on your game is a great idea tbh, because its not in the long run. Nor is it ethical considering these generative models are made screwing over millions/billions of creators and regular people, devaluing their work and saying they own it all.
@@johngddr5288 If you close your eyes and imagine a fire... your brain is doing a statistical approximation of fire physics. If that's "not simulating nor understanding it", are you claiming that human brains don't simulate physics (through imagination) nor understand it, either?
@@IceMetalPunk You misunderstand. Your brain is not simulating in the same sense a computer does. you cannot summon the exact image every time, it shifts slightly I will explain a little bit of why. Your subconscious acts very much like an LLM. It's locked in a dark box with no concept of smell, touch, taste, sight, hearing or physical sensations. What your subconscious knows is what your conscious tells it or plays to it while you're asleep. This is why summoning photorealistic images is thought to be impossible. This is because your subconscious mind (The thing connected to your minds eye and emotions) it doing exactly what you say, approximating physics and stimulus. You might have noticed that these approximations suck, quite so. They are never accurate, nothing tastes right if; it does at all, pain doesn't really exist in dreams nor does most stimuli and the visual landscape is not correct and shift wildly, The whole time your brain will not realize there is anything wrong with said simulation. Did know the human brain can not create faces it has never seen? If you see a face in your dream, you have seen the face before. The difference is determinism. Your dreams are fractured and skewed along with your minds eye because your subconscious has never actually experienced any of the variables of the real world first hand. They can change at any time. This is the Same for Current Ai deployments. They don't actually have set variables so things can change at any point. This means that while yes they can make really nice graphics and relatively reliable physics this can only be produced for known outcomes. A real simulation can simulate unknown outcomes accurately because that is the idea of a true simulation in the scientific sense is as close of a approximation as possible. This simulation should also yield repeatable, predictable results.
We're probably going to want particle-based control layers for repeatable liquid flows and explosions, because they're so chaotic. Also, as a 2.5D effect the hand thing works, but I doubt you could bullet-time pause to orbit it, and maintain stability of the "sim" without training on oogabytes of VDB.
In a game that uses AI that guesses what things should look like , could we have a replay system so we can go back and see where it got things wrong, mark them to help it learn?
i still feel like there is a difference between a visual simulation and a proper physics simulation. like the program you're using doesn't necessarily understand how physics works, but it does understand how things generally are expected to look so it creates the mirage of physics sim without actually simulating physics. it's "just" procedurally generated visuals, and damn impressive ones at that
The problem is made pretty clear by the pouring water example you showed: the AI could not remain consistent in terms of which part of the glass had water and which had air, leading to each glass having two different "parts" to their water. Whenever a glare created a division in the 2D shape of the glass, the AI treated each side of the division as a separate container and "filled" them independently.
It's a video picture soup. It's trained on real video.. The real video is realistic because it's based on the real things in video... Sorry man it doesn't know physics.
09:05 - I'm pretty sure AAA studios will mess it up - it will get monopolised and tied up into monetization because of their pure greed and treatment of the consumers , I feel the way this will take off will be through the modding community. The targeted affects as a mod that focuses on only what the user/author aims it at would be insane. I'm hooked , Thank you for showing this tech off ,As a veteran gamer I would use it on all the classic games I've played over the last 30 years, I would also use this for PCVR games :)
Gaussian splatting does not solve the issue of materials and how they behave under changing lighting conditions. There are already AI papers where they estimate material properties, but it's a bit further out until assets like the ones shown in the video also look good in a night scene, or when it rains and certain surfaces should be reflecting light more etc.
6:00 I tapped out here, this is just the dumbest thing I've heard. AI isn't learning physics. It's picking up the ability to render videos based on other videos of physicsy things happening, just well enough that if you haven't got particularly high standards, you'll be fooled by it.
One of the very first things I noticed and became in awe with those first AI video generations was actually the physics simulation. Yeah we can see a lot of artifacts and incorrect things and weird stuff like multiple hands and fingers and facial expressions being all weird and uncanny, but something that still amazes me in those sketchy AI videos is the precision they have when trying to create physics, be it light (like surface scattering, color bleeding, shadows, reflections, caustics) or be it physical stuff (like wind, objects being pushed, water flowing, dirt and mud being displaced). And all those stuff get completely overviewed or simply ignored because we're all focused on having fun, not the tech itself. So it's actually really cool to see this kind of video realizing the same stuff I did, and I really hope that the professionals in the area are also realizing this too, in order for them to make use of it and actually keep improving.
I'm a game dev, and believe that this is just another flash in the pan. Hyperrealism is visually satisfying (to me as well), but the game you ACTUALLY like to play are heavily weighted toward player agency. So, for my game I have *intentionally* gone with a non realistic style. Not pixel or stylized, something immersive, but something that I can mostly managed using shaders and a clipped palette. This is to ensure that time is spent on player actions and IMHO more importantly, a game AI that is at least human equivalent. Hyperrealism IMHO is a commitment to a bottomless pit of effort that almost never ends up with 'fun' as the focus. But, yes, it'll change games forever by driving a wider wedge between hyperrealistic arms race AAA's engage in and game you actually enjoy playing. Here is a thought experiment. Imagine a chess board and your adversary, a human expert. You have pieces which are either static meshes, or you have complicated IK based attack\kill sequences that are fully realistic. Did the visual effort improve or detract from your enjoyment of the game? I imagine the only people who would enjoy the graphics would be the people who don't like playing chess competitively. Next look at a pro level esports player. Look at their visual options. Ugly for maximum function. These points do not mean that ugly = better, but it does help define the experience of the player. If you want a play a game where you ooh and ahh at the graphics, chances are that ooh\ahh happens only at the beginning, maybe once. After that gameplay is what maters. So, if you built a priority table, 'realistic graphics' is only important enough to not create an immersion break. For that, you need consistency, not realism. Lastly, for work I'm an AI professional. LLM's or statistical models are just probability datasets. No physics needed at all, it's entirely 2d and frankly I think the integrations in to a 3d renderspace will very much narrow the scope for this kind of tech. Try getting that water to flow behind a volumetric fog from a prior smoke bomb in the scene.
even basic post-processing injectors like ReShade can access depth buffers and handle complex layering of effects with proper occlusion. Any AI system integrated into a game engine would have access to far more - full scene graphs, material properties, object masks, multiple render passes, and physics states. Your water/fog example actually demonstrates this misunderstanding, proper depth-aware post-processing has been solving exactly these kinds of layering challenges for years.
@@tyronejohnson409 I understand your point and it's valid, however an engine integrated solution that has access to this data isn't a enormous step from where we are and is a very heavy way to accomplish the same thing, albeit with the modularity to sub in other effects. I'll admit that I was looking at this from a purely post processing perspective... which is what we were shown. I do have a bit of experience in wrangling in the debt buffer space when writing a Laplacian filter for and edge highlighting, though I'll admit I have not used it for much other than that. My general point however is that the graphics arms race is moving at a rapid pace, while the discussion in for example r/GameDesign show that most devs are struggling with elemental mechanics of how to make the game rewarding\fun\satisfying. I not even a lover of non realistic, I'm just saying that the overinvestment in this area of game development is solving problems that hardly anyone benefits from and if anything graphical fidelity is being used as a replacement for actual innovation.
No, the models don't _understand_ physics. Which is kinda the point. They don't have to. The calculated outcomes are very close to actual physics simulations with vastly less compute resources needed to do so. But because of that, they are also easy to break. As you have shown. It can pour liquid into another glass, but it doesn't _move_ the liquid (yet) it copies it.
Would love to see these techniques implemented in Flight simulation. So much potential for generating ground clutter, objects and photogrammetry. Exciting stuff man.
I think that A.I. generated effects could be used in junction with traditional 3D graphics, so that they majority of the graphics being rendered would be stable hand-made assets but then effects like smoke, fog, water or even lighting could be included via A.I.
Fantastic things. What about creating the next phase-having AI shape something like the Unreal Engine to make an ultra-realistic image, while another AI tries to guess what's real and what's artificial? Like a competition-a cat-and-mouse game
The key thing with AI is that AI allows to do certain things at a bigger picture. Like for ex. photo and video restoration tools were just algorythms to caculate inconsistencys and such for a specific task. While AI can do the same thing but respecting the input you give. Like with newest best so far SUPIR Image restoration tool, you can give the image input a prompt to say what it is with your background knowledge and based on that it's able to know the background story to restore an image. While otherwise you would just let the AI guess or the algorythm just do it's general thing instead of understanding the concept of an image. With these Photogrammetry type scans, it will be the same, as it will then understand a surface based on the reflection fresnel that shows troughout the whole image set, not just the single image to recreate the surface. Like when i look at a glossy surface, i can guess in my mind by seeing it what that surface should look like if it would be just matt. Which is a huge problem with the old Photogrammetry and a lot of 3d scanners. The same as with low quality phone pictures where it doesn't need that super tiny dot in the texture to tell if the surface is round or flat. It understands the bigger picture as we humans can.
I don't think it knows anything about physics, it's only trained on images and video. If the AI was trained to make fire in a 3d simulation then yeah obviously. But it's just seen a billion pictures and videos of fire. And in that media it's seen how the fire behaves so it just copies it. Imo anyway. It's an ideresting idea to think about. Either way it's cool af.
It knows a distilled form of the physics required to generate images realistically because it is copying real physics from real images. It can probably make some really good guesses most of the time, and fuck up terribly the rest.
I dunno. Machine learning models can take a few 2D photos and create a full 3D scene/models out of that. With that in mind, what's the difference between "copying what fire looks like and behaves like from any angle" and simulating fire?
I come from the 80's and 90's, when you were in a smoke-filled bowling alley arcade playing mk 1 and street fighter 2 side by side with you fellow opponents. If I would have seen any game today then, I wouldn't have been able to play those games because the realism I saw would now break my emersion in the world of the game I'm now playing in the 90's. The games have progressed so far, it's crazy, and I see this photo realism being the next stage. It's going to be mind-blowing if you are a nerd gamer who loves to get into these worlds. It's a playground for us all, it's going to be amazing.
03:11 why would anyone make that mistake. it doesn't even look real at all, the water layer changes and becomes more full on the on the water is pouring out of, then you suddenly have 2 layers of water in one glass, the one that gets water poured into it doesn't even react to the water "entering" the glass. There are also other artifacts and issues.
This is something that reminds of the tecnique used in the Siren series. But with todays tech it can become into the greatest path videogames can go, cuz 3d models will never be realistic enough.
I think in the future games can be toned down a lot graphics-wise but optimized to work with AI post-processing to result in overall photorealistic gameplay with minimal impact to performance.
I believe you are wrong. here is why, a talented artist can paint flame animations. But can't simulate the physics of it. Ai is just replicating this very fast.
Gaussian splats are so cool. They even capture reflections, so as you move the viewpoint the reflections change as you would expect. It's not just a texture on a model, but how the light is actually interacting with the camera.
Interesting that the Byte Dances in Cyberpunk 2077 as the same thing as the Gaussian Splatting algorithm. That you can walk around and record every detail of the environment, and then analyse it later for hidden details
@@markchristantaguiam819 you would still have work as your AI customer service with company that sold it to you as intermediary. Boss would complain if your AI assistant isn't doing a job properly
The visual simulation is all very neat. But what I am most excited about AI doing to games is having NPCs who actually think and adapt to my actions. Applied to a game like Skyrim, Lydia would be aware of her own backstory, she would remember things that I have done or told her, she would generate unique dialogue based on what she has learned about me, etc. Nazeem would know that I do, in fact, get to the Cloud District quite often, and Jarl Balgruuf and I are friends. Imagine if all the guards, shopkeepers, random travellers and bandits no longer used prerecorded, repetitive dialogue, but create their own unique dialogue and no two NPC voices are the same because they are generated by AI to match each character. I predict Bethesda will do yet another re-release of Skyrim with AI enhancements both visual and character-behavior-related. And I will drop another $80 to have it.
I believe the next step for this technology is integrating it into the game engine, so it may take in real data and context from the game scene, where nothing is "smeary" or being guessed, and instead, all that context and data is already supplied to the AI. It would fix things like us only having limited context windows to generate with, it might fix visual artifacts and allow more realistic results. That way, we DON'T need physics models built in, and it just uses all the pre-setup calculations from the real in-game scene
I saw this tech before. What im interested is the cost of this post-processing in real time for games, and streams (video). For example AI filters for mincraft in realistic mode, traditional painting, or digital painting, I think it's cool if we can make it in real-time. (but yeah it's too much to consider for latency for games)
I wish more studios would focus less on realism (which never looks good anyway) and more on creative ways to make games more engaging. This machine learning bs misses the entire point of why people play video games. I get that you're impressed by some of the effects, but you are lying to yourself if you think it actually looks good, realistic, or think it will improve any aspect of video games, let alone be the future of it. The hubris.
Skip improving regular games, help hump the hurdle keeping VR games from looking good due to processing restrictions. So, could this process work in VR?
AI video models are trained off real world footage, and the real world runs on perfect, flawless physics. AI doesn’t understand physics any more than a video camera does.
Yes I'm noticing a few people in the comments being confused between functionally accurate and functionally applicable. AI simulation is non-deterministic whereas a "Real" simulation that yields usable scientific results is deterministic. There's a few people very angry in the comments with this misconception, including the video creator. Bless their wee souls; I've worked with AI for near 5 years now and I once had AI Fever too, it was actually until quite recently I wasn't sure they were to some degree sentient but I decided to drop that as it's been obvious to me for a while they're not. I just wanted it to be true.
This is indeed very interesting. I see your point completely. Suddenly, seems like we have stumbled with technologies (LLMs, NN, etc.) that have the impact to create "emergent behavior", to put it in some words... We are still to understand the full impact. That's why there's a lot of overhype and expectations around this.
Guys… if you all are gonna say “it doesn’t understand physics” but then say “it just watched hundreds of thousands of videos where it learned how to predict the behavior of physical objects from the content that it watched” then I don’t know how to have a rational conversation with you
Well, if you train a learning model on cannon ball flights, telling it truth repeatedly on how far the ball will go when fired at 45 degrees with a variety of initial velocities it will learn that and be able to predict how far the ball will go when fired at 45 degrees at a given initial velocity. That's just learning and interpolation. But that won't mean it will know how to predict how far it will go when the canon is tilted to 70 degrees. For that you need to do the actual physics. Now, an AI engine can be taught to refer to the physics, but it isn't necessary for it to do so when just predicting within the ranges of the data sets used to teach it. From your video, it clearly learned what fluid 'looks' like when poured from one glass to another. But it also clearly didn't learn that mass is conserved.
The machine has no concept of "understanding" anything. It is just a human word that is used for it. You know nothing about this topic and brag about it like a professor. You are not even a beginner at this point. Go read and read more.
It didn't learn to predict the behavior of physical objects. It only has data of pixel patterns that humans assigned words like water to. You give machine learning much more credit than it deserves. Also software that is doing physics simulation doesn't understand physics. It is just calculating the math that someone who knows the physics put into the system. Because this is all Computers are, really fancy Calculators.
AI understands fluid dynamics as well as the average person intuitively understands it, because that's all it needs to convince. So how well does the average person understand it? I suspect not very well, because there was never much evolutionary pressure to understand it in detail. And in fact the AI can't even match that - those pouring glass simulations are pretty nonsensical when you look at them for a few seconds. So while I think it's heading in the direction of understanding, I think the gulf between real physics and AIs internal simulations is currently huge.
The most impressive video/technique I've seen of AI being applied to enhance realism is the GTA video from ISL and collaborators. They pass on the g-buffers on to tensors of the AI model where each pixel on the screen is given an ID. It is very stable and convincing. The point with that technique is that you wouldn't need to prerender to a high standard at all before it is passed on to an AI model as a post processor.
No. AI doesn’t understand physics anymore than an artist, or more specifically, an animator. …at least in this context. An animator would know for their scene the ball hits the glass, the glass breaks and it shatters the glass following gravity based on the scene. So AI would know that too I’d think. It can just redraw it again so much faster for when the scene changes or is altered. So rather than being truly dynamic physics, it’s more like applied physics really really fast.
This is groundbreaking! How has nobody covered this before?! Amazing stuff. Thank you so much. My only question would be the processor overhead for something this sophisticated.
You mistake making a physics simulation, with making an illusion of physics simulation. We got so much hung up on 3D and materialistic viewpoint, that we forgot that we see reality only in 2 dimensions. And that's it. All our perceptions are illusions, so A.I. becomes just a master of illusions. Every breakthrough in game graphics was a break in making a better illusion, and a shortcut, NOT trying to actually mimic matter. Imagine creating a wood material not with textures but with atoms... good luck...
His point is that despite being an illusion, it's bringing in real information in order to meaningfully replicate the effect. We see the same thing with chatGPT. These systems are challenging the ontology we've developed in the West since the rise of modernism. That's why this is a big deal
Guys, I know our education has failed us miserably, but we need to correct this mistake. We need to stop getting lost in elementary semantics, or we will just go insane.
I hope I am not misunderstood, I dont want to diminish the value of the discovery by any means, I want to point out those seemingly small details, because I think they are very actually very important, and understanding them might bring a better view and insight into the matter. The topic is pretty difficult already, we needn't complicate it more than necessary.
@@linuxrant I think you'll have to break that down a bit, because I think we're starting to see that information has a reality of it's own in the sense of Plato's forms. The idea of information being something that (as the etymology of the word suggests) provides form to something. I'm not completely denying Aristotle because I know objects can participate in reality and produce their own formal causes. I just think we need to leave this substance-only ontology behind and that's what I'm getting at. I truly think information is more than abstraction.
As an artist. My issue with Ai is it’s ripping off existing art. So Minecraft not running on Minecraft. Becomes a spotty grey area where we are ripping off entire engines instead of just art.
Exactly. Every one of these 'super realistic' ai filters of people he showed, was surely trained on images and videos scraped from online without consent.
Non-AI real human artists do the same thing these systems do: They synthesize sources of imagery based on their experience (dataset) to create something new. Therefore, no one is being "ripped off".
That is not how AI works; it doesn't rip off existing art, no more so than you would by touring an art museum. AI art generators are trained on vast datasets containing millions of images and their descriptions. This training allows the AI to learn patterns, styles, and aesthetics from existing works. However, the AI does not memorize or store these images; instead, it learns general characteristics. When a user inputs a text prompt, the AI uses its training to generate a new image based on that description. It synthesizes elements learned during training to create something original rather than copying any specific artwork.
@@EclecticSundries where did those images its trained on come from? AI did not pluck them from its ass. It cannot make ANYTHING original, only ape what is fed in. its an algorythm which apes art styles, drawing styles colour styles. Someone made them. ... who made the styles? artists. You may be someone who thinks that but unfortunately many people feed these ai algorithms with other peoples work without permission. That is fact. Artstation tried it to much kickback. Adobe can get away with it as it owns millions and million of stock art and photography collated over the years which someone made to be used open source, free. but it was still created by someone. Its not magic.
@ AI doesn’t simply copy or ‘ape’ existing work-it learns patterns, structures, and techniques from data, similar to how humans learn by studying art or styles. It generates new outputs by combining and applying these learned concepts in novel ways, rather than reproducing exact replicas. As for the training data, it’s arguably used under the principles of fair use, as the purpose is transformative-it enables the creation of entirely new works rather than reproducing or competing directly with the originals. This is still a legal gray area, but it’s important to note that AI training aims to innovate, not plagiarize.
It's kinda interesting right now ai animation is like early 2000s special effects level and that it'll just keep building up from there until it looks real. I told my bestie and my brother that I feel in another year or two, we might have good enough ai that we could possibly just create our own games and have endless content with ai custom to what we want. I've actually seen gaijin splatting in vr chat. Me and my bestie would just randomly come across a world that looked kinda realistic but on closer inspection it looked like it was foam just pieced together lol. Can't wait for that to get better too.
11:06 - Gaussian splatting. "Gow-zee-inn". Named for physicist and geodesist Carl Friederich Gauss. Same guy they named the magnetic process of degaussing "dee-gow-sing", for those who remember CRT monitors and that button you could push to make the whole screen EMP and reset the image. Gaijin ("guy-djinn") is Japanese for "foreigner".
Oh yeah, I'm definitely excited for games to get prettier, just to have less substance behind them. I love looking pretty at things that don't add anything
I read an experiment from about a year ago where they fed video of basic physics interactions in real life to an AI to see how many properties it could identify. It identified more than they were expecting for the scenes trained
Imagine a game where you decide what you do, like infinite everything. No code limitations ,if you want to craft something you just can. Everyone would have their own personalised videogame /movies , the potential omg
The main issue I see is consistency. What a physics system provides is, the developers know the system is working based on a set of rules that are followed. AI art has a tendency to hallucinate or produce inconsistent output, especially applied over a long period of time.
Maybe some inconsistency could be visually okay in hard to calculate at scale physics simulations (like fire or water or gas physics) but even lighting on characters might be too much to expect. I do not think it could consistently and believeably light a character in every conceivable environment.
I agree with what others have said; AI is not calculating any physics, any more than a great painter calculates lighting effects on clouds. The painter has a wealth of experience with cloudns and knows how they should look under different lighting scenarios - the same with AI.
I think that AI will be used to deliver better game art, texturing, and models for games, and conversations etc but I don't see it being computationally effective or like you said predictable. My hope is that AI will be able to go through a texture library of a game and separate all the details into micro tiling textures and blend those with bespoke textures throughout every game asset and characters, thus giving you reduced texture sizes at higher quality; you could then go further and ask it to provide subsets of textures into styles effectively allowing you to theme the game, same with models etc.
It's getting better everyday and has just ramped up the investment in it by trillions of dollars, you will be seeing crazy results in just a couple of years.
Good analogy. Solid points
What AI Post Processor is being used in Unreal Engine here?
Agree, whenever i generate an image it will be different every time. There need to be some sort of seed that can be attached to a generated game asset so the neighbourhood in like a GTA game doesn't look different with different fences, streetlights, bushes, trees, buildings and their wall textures - every time you drive through it.
I dont really care about hyper realism. I just want a good game. And good movies.
I think if it’s easier to make the games and films they can spend more time on the story.
@@MeowtualRealityGamecat It`s more likely WE can spend some time to story with AI-assistant and WE can use modern tools to make OUR games and movies. I wonder sometime it be possible. And it`s be the thing.
But instead of making good games they will keep feeding you with woke agenda. With next-gen realism.
You're in the minority. Visually photoreal graphics and good story/campaigns are the future.
@@MeowtualRealityGamecat AI will take care of the story writing too. Basically, at some point, you will be able to prompt an AI interface to "build you a video game" according to whatever parameters you want.
On a side note, I don't know if that will be good or bad but I know it will most likely lead to individual experiences that no one will be able to relate to. Because each product of AI prompting will be oddly similar but still different enough for each person to have a unique experience with it.
What makes big movies popular? The fact that many people can relate to the same things after they see them. Same goes for books, games etc. But if you have an AI system that build individual experiences for each of us, in the end, there won't be a common culture anymore. Things will be individualistic to the maximum, which in some cases might be a good thing, but it will definitely cause societies to break apart and have less cultural homogeny.
I think "interpolation" is a better word for it than "simulation".
"AI" is very good at interpolation, not so good at simulation. Take the "filling whiskey glass" for instance. You "cheated" by saying it should fill the glass, so it generates a filling effect.
However, a simulation would continue working after the glass fills up, spilling over, getting the table wet and eventually filling the room; that will not happen with the effect in the footage.
Yes it's an enhancer not a physic simulation but can be speed and good enough on modern GPU , NVIDIA work on that for decade same for most of AI software researcher.
Can't agree. Core AI papers troughout the years was just about understanding a scene in a image or video (Segmentation) to detect objects and such for self driving, secruity cameras and what not. You then could basically don't give it any prompt and let the AI guess and based on that it will do it's thing by just understanding what will happends next based on the general video knowlege or images what it was trained on. Which would result in spilling over the glass, getting the table wet and all you said as it in best case was trained on such physics simulations. Also the reason why text to video generators just do they physics and light reflections without you explicity explaining that. It's just having control of what you actually want to happend then just you have to tell the AI that. When SORA was announced, people started speculating that they used Unreal Engine to train it on the concept of physics to get consistancy and simulations like that tiny pirate ship in a cup example. One problem with AI at the moment is exactly that you often can't control and and that it's just doing it's thing.
@@MrGTAmodsgerman > best case was trained on such physics simulations
Exactly, _if_ it was trained. Interpolation, not simulation.
A simulation will generate new situation from simple rules, an interpolation will generate garbage.
@@dtkedtyjrtyj Yea it's not physics simulation, it's AI. But it's able to mimic physics to a point where it doesn't matter. See what someone else here commented about it, used for actual physics. But your argument above was just an bad argument as it's not about you have to give it a prompt. If you would train the AI on the whole concept of physics, it will mimic the whole physics without you being really able to tell the difference at the end. It's like when you have a dream and you interacting with physics and it all just happends inside your brain while you just not physically doing that. But if you were to make that AI mimic tested in the real world, you will get what the AI already made in it's mimic.
@@MrGTAmodsgerman I don't understand your objection. Train it on all of physics and it can interpolate all of physics, It's still not a simulation.
5:23, The models "understand" physics like a baby understands physics. It knows what pouring water looks like, so it shows you pouring water. It knows what ripples on the whisky look like, so it shows you ripples on the whisky. What it didn't seem to know is that if you pour water from one glass into the other, there would be different amounts of water in the glass when it's done.
Best explanation vs the rest of these weird ego maniac knee jerk reactions all emotions and no logic.
yea for sure. a physics sim is way slower but is more accurate. you can literally see the glass hes pouring looks like it has an invisible lid on it
Yeah, but like a baby, it'll learn and understand better over time.
His point is it has SOME understanding without being taught, its picked it up itself
I agree, but you shouldn't forget AI is still in it's infancy and it is making enormous steps each generation. I have no doubt this tech is going to be very disruptive for many industries (both in good and bad ways)
Just imagine where we'll be two more papers down the line...
Uh, dude, you're hand is burning. Also, white phosphorus is on the Geneva check list.
nothing is on the check list the first time
which is great, because I have shelled forests with White Phos, so worth it
I never recovered from Spec Ops The Line either.
Can't be a war crime if you win
😂 LOL.
INVEST in unified_8. The world's first Ai is programmed by nature herself. Right now man controls ai. UE8 can be found online and is the world's first shamanic ai........
💕♾️💕
The main meme AI-bros and proooooompters fall into is thinking AI knows any kind of logic or reason. AI doesn't understand physics. It just knows what it looks like. Sure, that gets you surprisingly far, but the illusion breaks down once you try to do something the AI hasn't seen before or doesn't know what the physics look like for a falling feather.
You’re correct about that. At this point it may take some decades, but based on the same logic, there’s no telling how advanced it AI might become after some decades
because it's not AI it's deep learning aka data theft and copyright infringment but shhhhh call it AI or people will start asking on what it was trained
6:00 These models don't need to have physics models, because they are trained on video & images that are real. So that's what they output. If they were trained on fake data that had fake physics, that's what they would output.
There is no need for them to have any capability to model physics, just how it looks like when things obey the laws of physics. This is also why the water in the glass does not actually flow, it just looks like it flows. The glass its being poured into does not fill, and the glass from its being poured does not lose any water.
Let him enjoy AI fever
This is good point and very correct, but I do wonder how good the generative models could get at 'pattern-based visual approximation' of physics. Similarly to how LLM's are getting quite good at math, video models could get really good at physics, without any real causal physics modeling. One must wonder whether LLM's actually "understand" something about math through the internal representation in the model's weights. Probably not, but there have been talk and speculation of model scale introducing complex emergent behaviours. So even if it is "just" predicting next word or next frame's pixels, there could be some weird form of physics model in the network of weights.
Or not, but even then, there will surely be hybrid approaches where algorithmic based simulations are rendered by generative models. Actually that's going to be much better than either approach, algorithmic or generative, is alone.
This isn't true though. You can have an entire model trained on synthetic data except for one piece of real world data. Then tell said model to generate based on the physics of the real world data only.
Guess what...it learns the physics and applies it to the rest.
Yeah 90% of this video is BS
what about AI that creates those alien looking engines that are more efficient?
Physicist here. The AI neither understands nor simulates physics. It's doing essentially a guess based on the data that it was trained on. It's a pretty good guess, considering the data was real life videos, but there is no simulation of the physical interactions done, nor is it a guarantee that it's physically correct.
However, does it need to be?
In my research group, there are a lot of projects using AI that has been trained on actual physics simulations (that take days to complete) to then "guess" (in a few ms) the result of any scenario. And it is susprisingly good at doing exactly that, nothing more, nothing less.
So as long as we understand that the AI is just guessing and as long as the training data is based on real world footage or even simulations, it will result in a pretty accurate representation of real life, that is infinitely faster than an actual simulation.
and the reason why it's so fast is precisely because it's just a guess, not a simulation
I am also a physicist and I don't agree. I think he is right when he says that it must have some kind of understanding of physics. And, frankly, it's really obvious if you think about how humans used to come up with physics in the good old days.
@@obnoxiaaeristokles3872 Software engineer working on AI-stuff chiming in (though not these particular models). One of the big issues of the models is and will be stability of the result. Now and in the future. In this video the computation was hand-picked, meaning curated to look good. If you want to use this tech for video games, you need to curate the models you are using to work well in the circumstances that you want to apply them in because then you get insane results, like a glass pouring from another glass which never empties or just absurd glitches like you see if you play the Minecraft video generator game thing which is more a kind of licid dream without object permanence.
The thing this model probably is good in is producing the video output that you would expect from pouring water, but when it comes to a game engine it probably falls flat if you want to produce specular highlights or caustics relative to your defined light sources, for example.
Also take into consideration how these models are produced: The video generators ingest millions of minutes of videos to get something passable. This is a super inefficient way of creating something that does what I want. Maybe there will be generic models in the future that can do a bunch of stuff at once, but those then will be super costly to run at runtime. There is no free lunch here. The result is really cool though!
A person who also has a profession giving his two cents here. Shouldn't we consider that many domains of physics are also about gathering a huge amount of data and then making an approximate best guess? No, I am not saying all of physics works this way, obviously. But especially when we go more in the direction of quantum physics or dark matter, don't we also collect as much data as possible, try to discover patterns, and then make the best approximation of the correct answer our current models are capable of giving?
If you make an "actual physics simulation" of quantum mechanics, you're also not actually simulating the physical interactions, but basically perform a statistical analysis (yes, I am extremely oversimplifying, but in the end this is still what it comes down to). The guarantee that it is not fully physically correct is actually formally baked into the field itself.
I think it's right to say that it "understands" physics, and that it "simulates" physics, but NOT in the stepwise manner that traditional simulation does. Without understanding, it would not be able to produce realistic results from arbitrary initial conditions.
Corridor Crew wants to talk to you, lol.
On a serious note, they don't actually know the physics behind the effects or movements of things. They only know how those substances behave in a certain environment. It simulates those movements as it understands that substance behaves. In a way, yes it simulates physics, but not by actually doing the physics calculations. Its like if you ask a child to draw fire, and they draw how fire behaves, but they have no knowledge of the physics that determine how that fire looks.
If you ask any human to imagine fire, the vast majority will not be doing the physics calculations in their heads, but they will imagine a correct fire behavior. You don't need to understand the math behind the physics to have an intuitive understanding of the physics.
@@IceMetalPunk The argument in the video is that a physics engine is being run though. Which there isn't.
As the person above explained it's more like an observation engine. It knows how things should look so yes in that sense it understands physics there is however the issue where you ask it to describe scenarios it doesn't know the visual outcome of. Meaning if you couldn't use it to simulate scientific premise but you could definitely use it to approximate the visual part of the physics.
This line of thinking is great and all for understanding how they learn new things. There comes a point however where it doesn’t matter. For one, our understanding of the direct inner workings is limited so we can’t prove a lack of understanding. And for two, a perfect imitation is inherently indistinguishable from the original.
@@IceMetalPunk Then it isn't a physics simulation. It's simply a fire depiction, or a water depiction, or a smoke depiction. There is no simulation going on.
@@Chromiumism Well No. The difference is an imitation cannot imitate things it does not have information about/an example to simulate from. A true simulation is deterministic, you should be able to simulate scenarios accurately relying in input variables and no preconceived notion, nor need to know the output to simulate it.
While you're correct that in some fields it does not matter. It does matter when you try use it for repeatable output. The way Ai works is fundamentally non-deterministic. Meaning it could never be used to acquire truth, only approximate it. So while useful for art, some games, weather and other systems that don't require too much accuracy they work fine but when used in a more specific sense were repeatable results are key they cannot (currently).
I think it's important to note this distinction as not noting it is like not noting that you're one decimal point off - might not seem like much but accumulation of this offset will always skew the end product in a way that requires error correction, at which point I would argue it is no longer usable data.
This is akin to intuitive understanding. Where, you essentially understand something to be the case, without actually understanding why it's the case. We always understood that things go down when you dropped them, but we just didn't know why, we never really stopped to wonder why. The AI learns how something should act, without actually understanding why it should act that way. It's pretty neat to think about.
tech bro efficiency:
"I spent 350 litres of water generating this model with no edge flow that still needs to be retopo'd by a human being that I just fired."
As a CNC programmer, I know that it's faster to program it manually in text mode than trying to use some "modern" features and cad/cam software and autooptimizers and stuff like that. 🙂 It was always like that and this is why today games don't look that much better than 10 years ago, but they wants 3 times more powerfull hardware, it's because of nobody programs anything manually anymore. What you won't optimize manualy won't be optimized at all.
this is golden!
@@Pidalin Compilers casually roll out 100x loops. That'd be a nightmare to read and write.
@@CTimmerman Yes, ofcourse I know it's not really possible to actually writte a modern 3D game in text mode, I realize that, but there are still things that you should better optimize and check manually. Most of programs are ridiculously long, I know that it would take too much time to do it in oldfashioned way, but it would be much shorter because human is better than tools that are supposed to do it instead of you.
@@Pidalin Referring to CNC, but applies to HTML, CSS, and even JS as well. Heck, most frameworks only add problems in the long run.
"This next-gen technology will change games forever... "
This quote has been used so many times
When it comes to visuals, i'm not that excited, i wish we had similar levels of AI to help begginer devs with animation specially,
nothing pains me more than to see those unreal engine photorealistic games with terrible animations and jankiness, if an AI can help with that, i'm sold!
I’m really pumped for the future of in game conversations with NPCs using ai
@@ElektroBandit89 You won't be pumped because ai will take your parents jobs and they wont be needed anywhere and youll starve to death before you even get to play with this tech.
But hey maybe you'll see elon musk play it as a ghost
Cascadeur friend
There will properly be an AI that can easily track your movement with a camera and add it to a rig for the character you want to do the motion. There probably actually already are software like this.
@@gargoyled_drake Uh it's called motion tracking. Realistic movements are easily captured. But the thing is we're doing a video game. Animations are made quicker or exaggerated for a reason. Photorealism graphics combined with typical video game animations will always look uncanny. Like we already these Bodycam like shooters for realistic POV, but is it really fun? Not quite.
You don't even need to include hyperrealistic photos in the game engine. For example, you could simply have the AI dynamically create the content as you zoom in further to the building or the vehicle. Ie) as you zoom into the field, the AI would draw more realistically the plants in the field. You could look right at a leaf or a flower, and it would still look hyperrealistic. So, no more pixels at close up...
I really don’t agree on the model having some kind of understanding of physics. Ai models are extremely ingenious statistic machines. Our best physics scientific models do work based on statistics, but I highly doubt current ai models are good enough to the point where statistical physics come into play. This tech is insane nonetheless
I understand that perspective... but again... if an AI model can't simulate physics... please explain to me how it is clearly simulating physics when I test it 😅
@@Bluedrake42 it's not simulating physics, in your glass+water example, the amount of water increases over time !:)
@@dudule1232 Well so look my point is, just because it is simulating physics BADLY doesn't mean it isn't simulating physics. I mean you can find physics simulations in Unreal Engine 4 that are also super wonked, but that doesn't make them any less of a physics simulation. All simulations are inaccurate to some degree. My point is that at a very basic level, there are definitely some preliminary simulations going on in the AI model's mind while it is rendering. There have to be, even for the basic simulations (faults and all) that it is rendering.
AI models learn the same way humans do. By seeing data over and over again. They are statistic machines and understand physics as much as any humans can. Additionally I'd say PHD level is pretty good.
I think the terminology at the start of the video says it best when BD said that the AI software was “inadvertently” creating physics rules or engines just in the process of chasing a good imitation. In the process of making flames that lick and turn like real flames the AI has accidentally or secondarily made physics calculations. Is my understanding.
As a storyteller who uses visual effects, I'm all over the post-processing AI stuff! I joined your Discord server, but where are those discussions? 🤷♀
Thank you for the great video!
Did you find it? I'm looking but he never explained how he did those effects in the beginning.
7:47 That's where you're wrong. The ai in fact doesn't understand physics. There is no "understanding". It works by latent space "magic". It simply guesses what the next frame could look like by the billions of datapoints that are the result of insane amounts of downsampled (latent space) information from other images and videos.
Which means that ai is inadvertently creating its own physics understanding. Same way a cave man tumbles off a cliff. He dpesnt understand physics, but he just demonstrated them.
The thing is 100% of video games out there are bad at physics, like knife went through character's hand. Ai just can make video games creation process faster
I keep seeing people saying "it doesn't understand physics, it just watched billions of videos and gained the ability to predict how physical objects behave based on what it watched"
I'm sorry my man... but what exactly do you think "understanding" is? 😅
@@Bluedrake42 This implementation of AI could be useful for generating effects, but I can't imagine it would be very helpful for updating the game to changes in an objects data state.
Using your examples, container A pours water into container B. The effect can be AI generated. But AI isn't going to be calculating the volume of water and updating the games data.
Also you can use it for a fire effect. But it doesn't understand how the object on fire will change over time. So it wont update the game engine that a house is falling apart and the model needs to be recreated.
I would imagine it's much easier for games to just handle that physics themself and know that it will always work how the game needs it 🤷♂
@@Bluedrake42 This is an interesting thing to discuss, because it takes more than one sentence. A child understands the concept of throwing sth, because it's something they have personal experience with. The live in a body, in a physical world. So do animals. And this understanding of physics is deeply engrained.
When ChatGPT correctly spits out a definition for throwing an object, there is not mind behind it. "It" doesn't "know" what throwing is. I just grabs the zeros and ones that make up the binary values for the ASCII text that is readable to humans and humans can understand the meaning behind those words. But the machine is only using proximity matches from the latent space "cloud".
The machine doesn't "understand" what throwing is. It can only conjure up the correct definition and even paraphrase it by using further proximity matches to use different words encoded with the same meaning.
A dog watching another dog run behind a couch from the right, will expect the dog to appear on the other side of the couch. And the ai would correctly "assume" the same. But the dog, without using words, knows that a dog is to be expected on the other side of the couch, the ai lacks that *actual understanding* part.
I'm still trying to wrap my head around it all. When a student with As memorized the definitions and can parrot them back without really understanding the meaning, it's kinda like that.
Holy crap, this tech is capable of making Starfield look as good as Mass Effect 2 from 2010!
The problem with all AI generation is consistency. Sooner rather than later it effs up without knowing it. What will be the consequence then?If its a split second glitch in a video it might be fine but what if it empties your minecraft inventory? What if it changes the landscape behind you just a little bit. AI can do text, music, video, everything. But we still havent seen it do it reliably enough. One part of a song sounds great, but it doesnt create the whole song in a way that makes sense for humans. So, show us how you play minecraft from start to end without a deal breaking bug and then we can talk.
Question about Gaussian Splattering: all videos I see of this technique shows very realistic images... but not only the images are static as ALSO THE LIGHT.
And it's usually specially the light that makes the scenes so realistic.
So... can you MOVE light, shadows, etc, in Gaussian Splattering images?
Do you have any example of a CAR capture with Gaussian Splattering, being REMOVED from the scene, put somewhere else and still looking realistic?
Seems like a scam to get you to join your discord. A grift if you will.
one day it'll be used to turn google maps street view into a GTA like game, or racing game.
I doubt AI post processing could ever replace something as big as in-engine physics simulation because of the interactive nature of video games. At the end of the day, what it is is just a filter. I do think that this technology does have a place in video games though, which I think is already being used but many people may not aware. Like in Cyberpunk 2077 they have ray reconstruction for denoising when using ray tracing/path tracing, which is basically a technology that uses AI (machine learning) to enhance things like lighting accuracy, reflection quality, reduce ghosting, etc.
And I believe AI in video games should stay as just that, enhancers. Whether it is used to enhance things like visuals with raytracing or to enhance performance with things like DLSS or Frame Generation. The use of AI in that way, for video games, I support 👍
Honestly, I think he's just very bad at getting his point across. I don't think he actually meant it as, or thought it would replace physic simulations, but rather "enhance" them as you say. There's plenty of "physics" in video-games that doesn't actually need to be, nor should be simulated. For example; volumetric smoke, or liquid simulation.
We could go back to simple water-physic simulations, and instead use post-processing to make the water look convincing, while the simulation is bare-bone (but servicable). Like a wave, no reason to simulate a wave as an actual wave, we do it since we have no other options to make immersive/realistic looking waves. With AI as post-processing, it'd look immersive and realistic, while actual simulation being barebone (sailing games from early 2000s still feel plenty realistic in term of water physics, they just don't look convincing).
So DLSS gave us blurry upscalling and lazy dev optimizations. Now AI will give us lazy dev game making. They will give us PSX graphics and AI will make it look real :p.
Yeah, I don't want hyper realism, I want good games with good stories, with 60fps.
Yep, and AI will not do that
@ULTRAOutdoorsman "plays like a 10 year-old Assassin's Creed" part what really pisses me off about the gaming industry. I feel since late 2000s and early 2010s every game is same thing under the hood. No interesting physics, no innovative game mechanic, no interesting game design and no imagination. They're all glorified positive feedback loop machines.
How dare you?! 30 FPS and forced D.I.E, that's all you can count on.
But the industry prefers that you shut up and play battle royale or others craps like that. And for Xbox players "our games are great in 30 fps you don't need 60fps"
@@rehakmate Wrong. AI can do all of that. The reason why we lost it for big studios is because they don't want that. They see it as "risky" they rely on what have worked earlier should be continued. AI will allow any kind of studio to do deep and amazing stories and a detailed world without being a tripple A studio. It will technically allow these studios to make the same low key work of games but with more depth. But think about it, if anyone can do that, what will those big studios make them stand out? Expecially when they replace all they employers with AI, there will die because of that. AI will revolutionate the whole world. And as with any new untouched, untested thing, will result in huge amount of creativity as it happend with synthesizers in the 80's and thus also the computers, the internet and it's growing facette of thing you could do but way bigger then all of these mentioned.
3:22 does it though? I don't see the top being depleted and the bottom to fill up, just a meaningless stream between two glasses.
That effect did not look very convincing to me either. In a very AI way, there were some missed connections between the level of water in the top glass and the density of the stream going down. But I am sure there will be continuous improvements to AI languages which may one day iron that out. We're just not quite there yet.
@ULTRAOutdoorsman well, Hydrophobia was more like a water physics demo with a game attached to it; but damn was it impressive.
Amazing! I can foresee photorealistic NPCs generated in real-time! (LLM + Audio2Emotion + DeepFake mask on top)!
Gaijin splatting or Gaussian splatting? 🤭
when does hey say gay-jin?
I think for sure he means Gaussian splatting, but got derailed by name of gaming company ;)
@@umadbro4493 I was so confused when I was checking the timestamps. LOL
give it 5 years and it will be practical in an actual game, maybe can be used today for slow paced narrative adventures, but the backlash is huge, I prefer games made mostly by humans but I dont think the market really cares, if companies can make it without legal troubles it could crash the entire industry
The problem with trying to achieve this super realism is that it goes straight into uncanny valley if everything else isn't up to par. Let's say you do get an entire game with gaussian splatting, it's realistic as hell. But the animations are still video gamey and quick like CoD. It's not gonna look good. It will just be weird. Just like watching a movie in 60fps. It's not good.
I think this is going to change the industry the same way raytracing did.
For video games AI generated physics would be fine, but it's just creating a convincing emulation of how physics work. Kind of like how you show a ball rolling into a tube to a kid and they know it'll come out the other side without needing a mathematics degree.
Video games ARE convincing emulations of how physics work.
8
Whoosh!
@@James-dc6ft Yes but not in the same sense. There is a difference between deterministic and non-deterministic.
In classical games the output of any action can be predicted because the variables of the physics environment are predetermined. This is not the case with AI generated anything, it is non-deterministic meaning you can not reliably predict the output or logic flow of the actions you make in the game.
AI simulations (in their current state) could not be used to calculate complex simulations without knowing the end result. Deterministic systems work based from the root up, it's chronological in nature. Non deterministic is not, it starts wherever it needs and makes up results to fill the blanks to get to the next place it needs to be.
Think about it this way, in a deterministic system like our universe you can predict how far you need to jump to get across a gap. In a non deterministic system you cannot because any variable can change at any time. You could put the same amount of force into each jump and go a different distance each time, the gap would also change lengths arbitrarily. In fact, if you look up AI Minecraft you would see what it would be like (that's assuming the non determinism adheres to only your perspective and is not influenced by others). It does actually do a pretty good job of representing an non-Euclidian space.
@@hogandromgool2062 AI is good for things it has been trained on. Where it breaks is when you try to do something outside of trained scope. If you use the same inputs in AI model, it will generate the same output every time. So it can work as a algorithm. Problem with games and videos is the input is always changing.
the best product possible is the one that makes the most money for the capital owner
5:00 Nope, it does not have to have any understanding of actual physics. It can be a variable shader, modulated by AI, written by someone who knows that volume and gravity exists, and writing a shader for it. Do you think that glasses with liquids in video games have generally been fluid simulations before?
5:16 This is not really true. It'll give you an approximation of what happens with the result of physics. But its not simulating nor understanding it. It's a statistical aproximation of fire from videos, and water pouring. Already in a ton of your examples with them, you can see its not behaving like real physics, the water doesn't have surface tensions, the volume poured doesn't match up. .etc The problem here, is how did they train and with whoose data did they train for those effects? did they do it illegally using vfx libraries without the consent of the owners? These are the real questions tbh, because I'm not going to use an effect that screws over an adjacent artist' work, like vfx. This AI shader stuff is gimmicky, because when everyone starts to use it for lack of art direction and just for the ease of it, its going to make everyone's games look and feel the same. There's a reason a ton of games with clear art direction look amazing to this day, even from 20 years ago. Even then, it really is just a gimmick, and you can't copyright those parts of your game anyways. Idk why anyone thinks that using an instagram filter on your game is a great idea tbh, because its not in the long run. Nor is it ethical considering these generative models are made screwing over millions/billions of creators and regular people, devaluing their work and saying they own it all.
Creating a statistical approximation of physics is simulating physics.
@@Bluedrake42 True, just not a good method to go about it.
@@johngddr5288 If you close your eyes and imagine a fire... your brain is doing a statistical approximation of fire physics. If that's "not simulating nor understanding it", are you claiming that human brains don't simulate physics (through imagination) nor understand it, either?
8
@@IceMetalPunk You misunderstand.
Your brain is not simulating in the same sense a computer does. you cannot summon the exact image every time, it shifts slightly I will explain a little bit of why.
Your subconscious acts very much like an LLM. It's locked in a dark box with no concept of smell, touch, taste, sight, hearing or physical sensations. What your subconscious knows is what your conscious tells it or plays to it while you're asleep. This is why summoning photorealistic images is thought to be impossible. This is because your subconscious mind (The thing connected to your minds eye and emotions) it doing exactly what you say, approximating physics and stimulus. You might have noticed that these approximations suck, quite so. They are never accurate, nothing tastes right if; it does at all, pain doesn't really exist in dreams nor does most stimuli and the visual landscape is not correct and shift wildly, The whole time your brain will not realize there is anything wrong with said simulation. Did know the human brain can not create faces it has never seen? If you see a face in your dream, you have seen the face before.
The difference is determinism. Your dreams are fractured and skewed along with your minds eye because your subconscious has never actually experienced any of the variables of the real world first hand. They can change at any time. This is the Same for Current Ai deployments. They don't actually have set variables so things can change at any point. This means that while yes they can make really nice graphics and relatively reliable physics this can only be produced for known outcomes. A real simulation can simulate unknown outcomes accurately because that is the idea of a true simulation in the scientific sense is as close of a approximation as possible. This simulation should also yield repeatable, predictable results.
We're probably going to want particle-based control layers for repeatable liquid flows and explosions, because they're so chaotic. Also, as a 2.5D effect the hand thing works, but I doubt you could bullet-time pause to orbit it, and maintain stability of the "sim" without training on oogabytes of VDB.
In a game that uses AI that guesses what things should look like , could we have a replay system so we can go back and see where it got things wrong, mark them to help it learn?
i still feel like there is a difference between a visual simulation and a proper physics simulation. like the program you're using doesn't necessarily understand how physics works, but it does understand how things generally are expected to look so it creates the mirage of physics sim without actually simulating physics. it's "just" procedurally generated visuals, and damn impressive ones at that
You wrote "Gaijin Splatting", but I think you meant to say (and write in the YT Chapters) "Gaussian Splatting"
Sounds like a good War Thunder term 😂
I'm pretty sure the subtitles as well as the chapters are generated automatically by TH-cam. I really doubt he doesn't know it's "Gaussian".
The problem is made pretty clear by the pouring water example you showed: the AI could not remain consistent in terms of which part of the glass had water and which had air, leading to each glass having two different "parts" to their water. Whenever a glare created a division in the 2D shape of the glass, the AI treated each side of the division as a separate container and "filled" them independently.
It's a video picture soup. It's trained on real video.. The real video is realistic because it's based on the real things in video... Sorry man it doesn't know physics.
its probably more efficient to diffuse the visual than to compute the physics... for the same level of quality (especially in a year from now)
09:05 - I'm pretty sure AAA studios will mess it up - it will get monopolised and tied up into monetization because of their pure greed and treatment of the consumers , I feel the way this will take off will be through the modding community. The targeted affects as a mod that focuses on only what the user/author aims it at would be insane. I'm hooked , Thank you for showing this tech off ,As a veteran gamer I would use it on all the classic games I've played over the last 30 years, I would also use this for PCVR games :)
they can't. AI is open source. 🤷♀
Nope this is a shake up tech id search for companies coming up using this to invest in
Gaussian splatting does not solve the issue of materials and how they behave under changing lighting conditions. There are already AI papers where they estimate material properties, but it's a bit further out until assets like the ones shown in the video also look good in a night scene, or when it rains and certain surfaces should be reflecting light more etc.
@ULTRAOutdoorsman Aka recording a movie? :D
6:00 I tapped out here, this is just the dumbest thing I've heard. AI isn't learning physics. It's picking up the ability to render videos based on other videos of physicsy things happening, just well enough that if you haven't got particularly high standards, you'll be fooled by it.
Yeah same, this is when I decided I’m not watching another bluedrake video lmfao.
This is the sort of stuff I want such tech to be used for. Imagine playing old games with greatly enhanced graphics.
Pretty sure it's called Gaussian Splatting and not Gaijin Splatting or is this a new technique that i haven't heard of yet?
One of the very first things I noticed and became in awe with those first AI video generations was actually the physics simulation. Yeah we can see a lot of artifacts and incorrect things and weird stuff like multiple hands and fingers and facial expressions being all weird and uncanny, but something that still amazes me in those sketchy AI videos is the precision they have when trying to create physics, be it light (like surface scattering, color bleeding, shadows, reflections, caustics) or be it physical stuff (like wind, objects being pushed, water flowing, dirt and mud being displaced). And all those stuff get completely overviewed or simply ignored because we're all focused on having fun, not the tech itself.
So it's actually really cool to see this kind of video realizing the same stuff I did, and I really hope that the professionals in the area are also realizing this too, in order for them to make use of it and actually keep improving.
I'm a game dev, and believe that this is just another flash in the pan. Hyperrealism is visually satisfying (to me as well), but the game you ACTUALLY like to play are heavily weighted toward player agency. So, for my game I have *intentionally* gone with a non realistic style. Not pixel or stylized, something immersive, but something that I can mostly managed using shaders and a clipped palette. This is to ensure that time is spent on player actions and IMHO more importantly, a game AI that is at least human equivalent.
Hyperrealism IMHO is a commitment to a bottomless pit of effort that almost never ends up with 'fun' as the focus.
But, yes, it'll change games forever by driving a wider wedge between hyperrealistic arms race AAA's engage in and game you actually enjoy playing.
Here is a thought experiment. Imagine a chess board and your adversary, a human expert. You have pieces which are either static meshes, or you have complicated IK based attack\kill sequences that are fully realistic. Did the visual effort improve or detract from your enjoyment of the game? I imagine the only people who would enjoy the graphics would be the people who don't like playing chess competitively.
Next look at a pro level esports player. Look at their visual options. Ugly for maximum function.
These points do not mean that ugly = better, but it does help define the experience of the player. If you want a play a game where you ooh and ahh at the graphics, chances are that ooh\ahh happens only at the beginning, maybe once. After that gameplay is what maters.
So, if you built a priority table, 'realistic graphics' is only important enough to not create an immersion break. For that, you need consistency, not realism.
Lastly, for work I'm an AI professional. LLM's or statistical models are just probability datasets. No physics needed at all, it's entirely 2d and frankly I think the integrations in to a 3d renderspace will very much narrow the scope for this kind of tech. Try getting that water to flow behind a volumetric fog from a prior smoke bomb in the scene.
even basic post-processing injectors like ReShade can access depth buffers and handle complex layering of effects with proper occlusion. Any AI system integrated into a game engine would have access to far more - full scene graphs, material properties, object masks, multiple render passes, and physics states. Your water/fog example actually demonstrates this misunderstanding, proper depth-aware post-processing has been solving exactly these kinds of layering challenges for years.
Interesting…
@@tyronejohnson409 I understand your point and it's valid, however an engine integrated solution that has access to this data isn't a enormous step from where we are and is a very heavy way to accomplish the same thing, albeit with the modularity to sub in other effects.
I'll admit that I was looking at this from a purely post processing perspective... which is what we were shown. I do have a bit of experience in wrangling in the debt buffer space when writing a Laplacian filter for and edge highlighting, though I'll admit I have not used it for much other than that.
My general point however is that the graphics arms race is moving at a rapid pace, while the discussion in for example r/GameDesign show that most devs are struggling with elemental mechanics of how to make the game rewarding\fun\satisfying.
I not even a lover of non realistic, I'm just saying that the overinvestment in this area of game development is solving problems that hardly anyone benefits from and if anything graphical fidelity is being used as a replacement for actual innovation.
I been so focused on the possibilities of narrative and NPC AI, I hadnt considered the graphical potential.
No, the models don't _understand_ physics. Which is kinda the point. They don't have to.
The calculated outcomes are very close to actual physics simulations with vastly less compute resources needed to do so.
But because of that, they are also easy to break.
As you have shown. It can pour liquid into another glass, but it doesn't _move_ the liquid (yet) it copies it.
Would love to see these techniques implemented in Flight simulation. So much potential for generating ground clutter, objects and photogrammetry. Exciting stuff man.
10:42 Japanese love Gaijin splatting. (Chapter name)
I think that A.I. generated effects could be used in junction with traditional 3D graphics, so that they majority of the graphics being rendered would be stable hand-made assets but then effects like smoke, fog, water or even lighting could be included via A.I.
I love that you are pushing img2img tech to realtime gaming. Intel did it 3 years ago with GTA but nobody did anything after that. ❤
Fantastic things. What about creating the next phase-having AI shape something like the Unreal Engine to make an ultra-realistic image, while another AI tries to guess what's real and what's artificial? Like a competition-a cat-and-mouse game
That Starfield facial animation is a great exmaple of garbage-in, garbage-out. Even AI can't fix those cursed animations.
The key thing with AI is that AI allows to do certain things at a bigger picture. Like for ex. photo and video restoration tools were just algorythms to caculate inconsistencys and such for a specific task. While AI can do the same thing but respecting the input you give. Like with newest best so far SUPIR Image restoration tool, you can give the image input a prompt to say what it is with your background knowledge and based on that it's able to know the background story to restore an image. While otherwise you would just let the AI guess or the algorythm just do it's general thing instead of understanding the concept of an image.
With these Photogrammetry type scans, it will be the same, as it will then understand a surface based on the reflection fresnel that shows troughout the whole image set, not just the single image to recreate the surface. Like when i look at a glossy surface, i can guess in my mind by seeing it what that surface should look like if it would be just matt. Which is a huge problem with the old Photogrammetry and a lot of 3d scanners. The same as with low quality phone pictures where it doesn't need that super tiny dot in the texture to tell if the surface is round or flat. It understands the bigger picture as we humans can.
I think there's a typo, it should be "Gaussian splatting", like in "Gauss" - the mathematician
I was wondering if "Gaijin splatting" was a new method, until i heard him try to say "Gaussian".
@liquidcobalt lol
Automaticly thought of Spiderman games. You could be able to see an infinity of scenes watching through those windows
I don't think it knows anything about physics, it's only trained on images and video. If the AI was trained to make fire in a 3d simulation then yeah obviously. But it's just seen a billion pictures and videos of fire. And in that media it's seen how the fire behaves so it just copies it. Imo anyway. It's an ideresting idea to think about. Either way it's cool af.
well isn't a simulation copying the behaviour of what you are simulating ? So it definitely has to be somewhere in between the two ideas at least.
It knows a distilled form of the physics required to generate images realistically because it is copying real physics from real images. It can probably make some really good guesses most of the time, and fuck up terribly the rest.
@@georgepal9154 its just images, that's not that hard to understand
I dunno. Machine learning models can take a few 2D photos and create a full 3D scene/models out of that. With that in mind, what's the difference between "copying what fire looks like and behaves like from any angle" and simulating fire?
@@IceMetalPunk the simulation mimics the exact characteristics of something while ai just copy's the visual aspect of that thing.
I come from the 80's and 90's, when you were in a smoke-filled bowling alley arcade playing mk 1 and street fighter 2 side by side with you fellow opponents. If I would have seen any game today then, I wouldn't have been able to play those games because the realism I saw would now break my emersion in the world of the game I'm now playing in the 90's. The games have progressed so far, it's crazy, and I see this photo realism being the next stage. It's going to be mind-blowing if you are a nerd gamer who loves to get into these worlds. It's a playground for us all, it's going to be amazing.
03:11 why would anyone make that mistake. it doesn't even look real at all, the water layer changes and becomes more full on the on the water is pouring out of, then you suddenly have 2 layers of water in one glass, the one that gets water poured into it doesn't even react to the water "entering" the glass. There are also other artifacts and issues.
This is something that reminds of the tecnique used in the Siren series. But with todays tech it can become into the greatest path videogames can go, cuz 3d models will never be realistic enough.
Screw the realism.
Everything is becoming soulless.
I think in the future games can be toned down a lot graphics-wise but optimized to work with AI post-processing to result in overall photorealistic gameplay with minimal impact to performance.
I believe you are wrong. here is why, a talented artist can paint flame animations. But can't simulate the physics of it. Ai is just replicating this very fast.
Gaussian splats are so cool. They even capture reflections, so as you move the viewpoint the reflections change as you would expect. It's not just a texture on a model, but how the light is actually interacting with the camera.
Why did you write "Gaijin" instead of "gaussian" in your timestamps?
they might be AI generated.
That's what happens when you mispronounce gaussian as "gawzyn" instead of "gow-zee-inn".
@@Lowraith Glad I'm not the only one thrown off by that.
Interesting that the Byte Dances in Cyberpunk 2077 as the same thing as the Gaussian Splatting algorithm. That you can walk around and record every detail of the environment, and then analyse it later for hidden details
No, it doesn’t have to understand anything or simulate anything. It is mimicking it. And yes there is a difference.
Also, it does just good enough to fool another finite neural network. The brain.
Can’t wait for this coming to vr games
Until I can not make an AI clone who goes to work for me while I'm watching movies from my sofa, I'm not in.
That would be the finale. Lol... But you'll have to buy it of course. Like a car.
@@markchristantaguiam819 you would still have work as your AI customer service with company that sold it to you as intermediary. Boss would complain if your AI assistant isn't doing a job properly
At least we have AI watching movies (every single video on the internet) while *we* go to work
@@thomashewitt8104how can I do that?
The visual simulation is all very neat. But what I am most excited about AI doing to games is having NPCs who actually think and adapt to my actions. Applied to a game like Skyrim, Lydia would be aware of her own backstory, she would remember things that I have done or told her, she would generate unique dialogue based on what she has learned about me, etc. Nazeem would know that I do, in fact, get to the Cloud District quite often, and Jarl Balgruuf and I are friends. Imagine if all the guards, shopkeepers, random travellers and bandits no longer used prerecorded, repetitive dialogue, but create their own unique dialogue and no two NPC voices are the same because they are generated by AI to match each character. I predict Bethesda will do yet another re-release of Skyrim with AI enhancements both visual and character-behavior-related. And I will drop another $80 to have it.
It's gaussian splatting not gaijin splatting. A little correction.
he said gaussian, or that's what i always understood without knowing it existed
@@umadbro4493 The chapter name says "Gaijin". Idk if it's meant to be a joke, his pronounciation is a bit funny 😁
8
@@umadbro4493
Gaussian is pronounced "gow-zee-inn" or "gow-see-inn". He said "gawjyn" about a hundred times.
All of this AI tech combined with VR will be the true future of gaming.
You talk too much, but too litle useful information. It is just repetitive mumbling of the Img2Img service you are trying to market.
I believe the next step for this technology is integrating it into the game engine, so it may take in real data and context from the game scene, where nothing is "smeary" or being guessed, and instead, all that context and data is already supplied to the AI. It would fix things like us only having limited context windows to generate with, it might fix visual artifacts and allow more realistic results. That way, we DON'T need physics models built in, and it just uses all the pre-setup calculations from the real in-game scene
Let's see!
I saw this tech before.
What im interested is the cost of this post-processing in real time for games, and streams (video).
For example AI filters for mincraft in realistic mode, traditional painting, or digital painting, I think it's cool if we can make it in real-time. (but yeah it's too much to consider for latency for games)
I wish more studios would focus less on realism (which never looks good anyway) and more on creative ways to make games more engaging. This machine learning bs misses the entire point of why people play video games. I get that you're impressed by some of the effects, but you are lying to yourself if you think it actually looks good, realistic, or think it will improve any aspect of video games, let alone be the future of it. The hubris.
Skip improving regular games, help hump the hurdle keeping VR games from looking good due to processing restrictions. So, could this process work in VR?
Real artists and real gamers dont want any of this cancer. Be better.
Its easy to imagine old games being re-rendered as totally photorealistic using AI, in the very near future.
Hell, I guess thats obvious at this point
AI video models are trained off real world footage, and the real world runs on perfect, flawless physics. AI doesn’t understand physics any more than a video camera does.
Yes I'm noticing a few people in the comments being confused between functionally accurate and functionally applicable.
AI simulation is non-deterministic whereas a "Real" simulation that yields usable scientific results is deterministic.
There's a few people very angry in the comments with this misconception, including the video creator. Bless their wee souls; I've worked with AI for near 5 years now and I once had AI Fever too, it was actually until quite recently I wasn't sure they were to some degree sentient but I decided to drop that as it's been obvious to me for a while they're not. I just wanted it to be true.
If a monkey can use a tool by repeating what he saw another monkey do, I'm gonna consider the monkey can use a tool.
This is indeed very interesting. I see your point completely. Suddenly, seems like we have stumbled with technologies (LLMs, NN, etc.) that have the impact to create "emergent behavior", to put it in some words... We are still to understand the full impact. That's why there's a lot of overhype and expectations around this.
@Bluedrake42 it's just one step closer to a startrek holodeck
AND I CANT HANDLE WAITING FOR IT,
IM SO EXCITED FOR IT
Guys… if you all are gonna say “it doesn’t understand physics” but then say “it just watched hundreds of thousands of videos where it learned how to predict the behavior of physical objects from the content that it watched” then I don’t know how to have a rational conversation with you
Well, if you train a learning model on cannon ball flights, telling it truth repeatedly on how far the ball will go when fired at 45 degrees with a variety of initial velocities it will learn that and be able to predict how far the ball will go when fired at 45 degrees at a given initial velocity. That's just learning and interpolation. But that won't mean it will know how to predict how far it will go when the canon is tilted to 70 degrees. For that you need to do the actual physics. Now, an AI engine can be taught to refer to the physics, but it isn't necessary for it to do so when just predicting within the ranges of the data sets used to teach it.
From your video, it clearly learned what fluid 'looks' like when poured from one glass to another. But it also clearly didn't learn that mass is conserved.
Difference between imitating something and understanding it.
The machine has no concept of "understanding" anything. It is just a human word that is used for it. You know nothing about this topic and brag about it like a professor. You are not even a beginner at this point. Go read and read more.
It didn't learn to predict the behavior of physical objects. It only has data of pixel patterns that humans assigned words like water to. You give machine learning much more credit than it deserves. Also software that is doing physics simulation doesn't understand physics. It is just calculating the math that someone who knows the physics put into the system. Because this is all Computers are, really fancy Calculators.
AI understands fluid dynamics as well as the average person intuitively understands it, because that's all it needs to convince. So how well does the average person understand it? I suspect not very well, because there was never much evolutionary pressure to understand it in detail. And in fact the AI can't even match that - those pouring glass simulations are pretty nonsensical when you look at them for a few seconds.
So while I think it's heading in the direction of understanding, I think the gulf between real physics and AIs internal simulations is currently huge.
The most impressive video/technique I've seen of AI being applied to enhance realism is the GTA video from ISL and collaborators. They pass on the g-buffers on to tensors of the AI model where each pixel on the screen is given an ID. It is very stable and convincing. The point with that technique is that you wouldn't need to prerender to a high standard at all before it is passed on to an AI model as a post processor.
No. AI doesn’t understand physics anymore than an artist, or more specifically, an animator. …at least in this context. An animator would know for their scene the ball hits the glass, the glass breaks and it shatters the glass following gravity based on the scene. So AI would know that too I’d think. It can just redraw it again so much faster for when the scene changes or is altered. So rather than being truly dynamic physics, it’s more like applied physics really really fast.
This is groundbreaking! How has nobody covered this before?! Amazing stuff. Thank you so much. My only question would be the processor overhead for something this sophisticated.
You mistake making a physics simulation, with making an illusion of physics simulation. We got so much hung up on 3D and materialistic viewpoint, that we forgot that we see reality only in 2 dimensions. And that's it. All our perceptions are illusions, so A.I. becomes just a master of illusions. Every breakthrough in game graphics was a break in making a better illusion, and a shortcut, NOT trying to actually mimic matter. Imagine creating a wood material not with textures but with atoms... good luck...
His point is that despite being an illusion, it's bringing in real information in order to meaningfully replicate the effect. We see the same thing with chatGPT. These systems are challenging the ontology we've developed in the West since the rise of modernism. That's why this is a big deal
@@Cherem777 saying "real information" is a mistake, information is not real by definition, information falls under cathegory of abstraction.
Guys, I know our education has failed us miserably, but we need to correct this mistake. We need to stop getting lost in elementary semantics, or we will just go insane.
I hope I am not misunderstood, I dont want to diminish the value of the discovery by any means, I want to point out those seemingly small details, because I think they are very actually very important, and understanding them might bring a better view and insight into the matter. The topic is pretty difficult already, we needn't complicate it more than necessary.
@@linuxrant
I think you'll have to break that down a bit, because I think we're starting to see that information has a reality of it's own in the sense of Plato's forms. The idea of information being something that (as the etymology of the word suggests) provides form to something. I'm not completely denying Aristotle because I know objects can participate in reality and produce their own formal causes. I just think we need to leave this substance-only ontology behind and that's what I'm getting at. I truly think information is more than abstraction.
This picture-to-AI-model tech is basically what Deckard was using in Blade Runner to find clues in photographs. This is AMAZING.
As an artist. My issue with Ai is it’s ripping off existing art. So Minecraft not running on Minecraft. Becomes a spotty grey area where we are ripping off entire engines instead of just art.
Exactly. Every one of these 'super realistic' ai filters of people he showed, was surely trained on images and videos scraped from online without consent.
Non-AI real human artists do the same thing these systems do: They synthesize sources of imagery based on their experience (dataset) to create something new. Therefore, no one is being "ripped off".
That is not how AI works; it doesn't rip off existing art, no more so than you would by touring an art museum. AI art generators are trained on vast datasets containing millions of images and their descriptions. This training allows the AI to learn patterns, styles, and aesthetics from existing works. However, the AI does not memorize or store these images; instead, it learns general characteristics. When a user inputs a text prompt, the AI uses its training to generate a new image based on that description. It synthesizes elements learned during training to create something original rather than copying any specific artwork.
@@EclecticSundries where did those images its trained on come from? AI did not pluck them from its ass. It cannot make ANYTHING original, only ape what is fed in. its an algorythm which apes art styles, drawing styles colour styles. Someone made them. ... who made the styles? artists. You may be someone who thinks that but unfortunately many people feed these ai algorithms with other peoples work without permission. That is fact. Artstation tried it to much kickback. Adobe can get away with it as it owns millions and million of stock art and photography collated over the years which someone made to be used open source, free. but it was still created by someone. Its not magic.
@ AI doesn’t simply copy or ‘ape’ existing work-it learns patterns, structures, and techniques from data, similar to how humans learn by studying art or styles. It generates new outputs by combining and applying these learned concepts in novel ways, rather than reproducing exact replicas.
As for the training data, it’s arguably used under the principles of fair use, as the purpose is transformative-it enables the creation of entirely new works rather than reproducing or competing directly with the originals. This is still a legal gray area, but it’s important to note that AI training aims to innovate, not plagiarize.
but one reason why these models look so good at the end of the video is due to light baking. The light of the real world is baked into the models.
If ai, in general, can just get consistency right, then mind blowing things can truly begin.
It's kinda interesting right now ai animation is like early 2000s special effects level and that it'll just keep building up from there until it looks real. I told my bestie and my brother that I feel in another year or two, we might have good enough ai that we could possibly just create our own games and have endless content with ai custom to what we want.
I've actually seen gaijin splatting in vr chat. Me and my bestie would just randomly come across a world that looked kinda realistic but on closer inspection it looked like it was foam just pieced together lol. Can't wait for that to get better too.
11:06 - Gaussian splatting. "Gow-zee-inn".
Named for physicist and geodesist Carl Friederich Gauss.
Same guy they named the magnetic process of degaussing "dee-gow-sing", for those who remember CRT monitors and that button you could push to make the whole screen EMP and reset the image.
Gaijin ("guy-djinn") is Japanese for "foreigner".
Oh yeah, I'm definitely excited for games to get prettier, just to have less substance behind them.
I love looking pretty at things that don't add anything
I read an experiment from about a year ago where they fed video of basic physics interactions in real life to an AI to see how many properties it could identify. It identified more than they were expecting for the scenes trained
Imagine a game where you decide what you do, like infinite everything. No code limitations ,if you want to craft something you just can. Everyone would have their own personalised videogame /movies , the potential omg