When Optimisations Work, But for the Wrong Reasons

แชร์
ฝัง
  • เผยแพร่เมื่อ 20 พ.ย. 2024

ความคิดเห็น • 1K

  • @simondev758
    @simondev758  9 หลายเดือนก่อน +291

    Patrons can now vote for the next video! Thank you for your support. Also, some additional links from the video:
    ❤Patreon: www.patreon.com/simondevyt
    😍Courses: simondev.io
    WolfFire Games Article: blog.wolfire.com/2010/10/Imposters

    • @shannenmr
      @shannenmr 9 หลายเดือนก่อน +6

      Another good video that talks about this is "Rasterization, Overshading, and the GBuffer" under the "An In-Depth look at Real-Time Rendering" Series on the Unreal Learning Centre.

    • @uncletrashero
      @uncletrashero 9 หลายเดือนก่อน +2

      of course none of it matters now because we got Nanite :O

    • @lazygenie5616
      @lazygenie5616 9 หลายเดือนก่อน +4

      Ok so not to be an ass but you used footage from Wolffire games website and it would be cool if you could link them in your description. They are a fantastic developer and deserve more recognition

    • @simondev758
      @simondev758  9 หลายเดือนก่อน +15

      @@lazygenie5616 Oops, I clearly wanted to credit everybody since I added their links in the video itself, and tried to include them all in the description.
      I've edited the pinned comment to include anything missing.

    • @jjeeqq
      @jjeeqq 9 หลายเดือนก่อน

      Should it be better if we had hexagonial displays instead of square pixels?

  • @FrozenDozer
    @FrozenDozer 9 หลายเดือนก่อน +4128

    This is the stuff about gamedev that almost nobody talks about, and if they do, it's way too indepth. I love channels like this. You're doing a lot for making people understand how complex and smart Game Engines and Render Tech actually are.

    • @aeoliun
      @aeoliun 9 หลายเดือนก่อน +119

      Hey guys welcome to my game tutorial. We're going to make a complete game from scratch.
      2 episodes. Last video was rendering a jpg to the screen. Last upload 3 years ago.
      Every time.

    • @smokinglife8980
      @smokinglife8980 9 หลายเดือนก่อน

      ​@@aeoliunyou should not need a tutorial to make a game from scratch I'm not the best or most knowledgeable coder but I am making my own game engine using c++ and I mainly use c# but I'm getting it done

    • @w花b
      @w花b 9 หลายเดือนก่อน +56

      ​@@aeoliunhonestly I kinda get them. Very few people will watch it in comparison to a much faster tutorial on an already built game engine. Imagine the dedication and free time required to make such a long thing... If doing it alone seems awfully long, imagine having to record AND edit that video (potentially)

    • @youtubelisk
      @youtubelisk 9 หลายเดือนก่อน +4

      What do you do with this knowledge?

    • @justseffstuff3308
      @justseffstuff3308 9 หลายเดือนก่อน +18

      @@youtubeliskHave fun knowing this :)
      Knowledge is its own end, imo. It's just fun to learn, when the education is properly executed.
      Might as well ask what people will do with their time playing video games- except this has a small chance of being slightly more practical

  • @almachizit3207
    @almachizit3207 9 หลายเดือนก่อน +912

    This also really helps explain how 4k gaming is possible on these GPUs. In terms of GPU usage efficiency, smaller pixels is effectively the same as having larger triangles. So while 4k screens have 4x as many pixels, you're also throwing away far less work that the GPU is doing, which helps regain some of the performance loss

    • @vanqy.
      @vanqy. 8 หลายเดือนก่อน +38

      im wondering if eventually stuff like nanite gets integrated on a software level for gpu drivers and the architecture architects just go yolo and increase 2x2 quads to 3x3 nines or 4x4 sixteens. or a mosaic pattern of pixel groups that mimics and has statistically highest coverage with most common triangles, so that less culling is in place.

    • @vyor8837
      @vyor8837 8 หลายเดือนก่อน +26

      ​@@vanqy. You can't put that type of thing on the driver level, maybe the API level.

    • @MustacheMerlin
      @MustacheMerlin 7 หลายเดือนก่อน +25

      Conversely it also explains why using technology like DLSS to render at ever smaller and smaller resolutions doesn't improve the performance nearly as much as you'd expect. Like, you'd think rendering at 480p should be orders of magnitude faster than rendering at 4k, since there's orders of magnitude less pixels, but it's only a little faster.

    • @bricaaron3978
      @bricaaron3978 6 หลายเดือนก่อน +24

      @@MustacheMerlin *"Like, you'd think rendering at 480p should be orders of magnitude faster than rendering at 4k, since there's orders of magnitude less pixels, but it's only a little faster."*
      Errr.... Rendering at 640 x 480 IS a buttload faster than rendering at 3840 x 2160. Just as one would expect.
      DLSS involves a whole lot of processing, just like AA or anything else. And so --- just as one would expect --- it doesn't deliver the same performance as actually rendering at a given nominal resolution.

    • @artosbear
      @artosbear 5 หลายเดือนก่อน +6

      Nanite rendering is absolutely destroying the performance in Remnant II, along with "smooth framerate" I know positively because using a mod to disable them increased performance so much I turned off upscalers, at 4k. So Ive been looking into nanite trying to understand why this tool for making and running high levels of details could also destroy performance, for little fidelity benefit

  • @freelunch1458
    @freelunch1458 9 หลายเดือนก่อน +525

    “I’ve always been interested in optimizations” man I wish this was a requirement to work for any big game company 😭

    • @Loggus66
      @Loggus66 2 หลายเดือนก่อน +13

      It is, they optimize their expensive dev time spent so your PC can be utilized better.

    • @esmolol4091
      @esmolol4091 หลายเดือนก่อน +5

      Dev might want it to be optimal, but the company suits at the top just care about fast releases and pleasing shareholders, which puts optimization at the rock bottom on the priority list.

    • @rianp1300
      @rianp1300 26 วันที่ผ่านมา +1

      The problem is not only the developers, but the budget and the deadline

  • @tux1468
    @tux1468 9 หลายเดือนก่อน +1911

    thank you so much for saying "impostor" at least a dozen times while not making a single among us reference, im proud of you

    • @agushernandezquiroga9064
      @agushernandezquiroga9064 9 หลายเดือนก่อน +421

      I didn't even think of among us when watching the video, my brain must be healing.

    • @ps-dh8ef
      @ps-dh8ef 9 หลายเดือนก่อน +119

      @@agushernandezquiroga9064 agreed. I think i am healing too

    • @prometheus9732
      @prometheus9732 9 หลายเดือนก่อน +47

      I haven’t thought about among us for 1 years.

    • @FabulousJejmaze
      @FabulousJejmaze 9 หลายเดือนก่อน +82

      GET OUT OF MY HEAD GET OUT OF MY HEAD GET OUT OF MY HEAD 📮🔪

    • @cyberneticsquid
      @cyberneticsquid 9 หลายเดือนก่อน +3

      oh hi tux

  • @kittenmittenkitten
    @kittenmittenkitten 9 หลายเดือนก่อน +1487

    I like that Epic's approach to finding out that subpixel polygons kill render times isn't to tell artists to avoid intricate geometric detail and to make more LoDs, it's to double down and make a system that lets the artist go absolutely wild with detail and not need to even think about LoDs.

    • @djmips
      @djmips 9 หลายเดือนก่อน +289

      This was incredibly challenging and I think it was the end result of a decade of research.

    • @mrpojsomnoj3313
      @mrpojsomnoj3313 9 หลายเดือนก่อน +78

      @@djmips Decade to find bottle neck. Some month to fix one.

    • @DreadKyller
      @DreadKyller 9 หลายเดือนก่อน +508

      There is one downside to it however, mostly from inexperienced devs working with Nanite, people are using Nanite as excuse to have extremely high detailed models in the game, even for things the player will never be close enough to notice. While Nanite alleviates many issues with the performance of such assets, it leads to far, far larger game sizes as many people just throw high detail photogrammetry scans into the game. Just because Nanite can handle automatic LOD doesn't mean no effort should go into optimizing the base mesh still, just that you don't need to author LODs manually, the base model still shouldn't be far more detailed than it needs to be, we already have 300GB games now, we don't need 500GB games because people throw in hundreds of 5-10GB models.

    • @SimonBuchanNz
      @SimonBuchanNz 9 หลายเดือนก่อน +115

      ​@@DreadKyllerNanite also prebakes and quantizes meshes, so there's really no excuse as it's basically moving a slider.

    • @woobilicious.
      @woobilicious. 9 หลายเดือนก่อน +18

      @@DreadKyller GTA V is over 10 years old, and 100GB in size, games should be 1TB in size by now...too bad Storage tech hasn't kept up.

  • @matts.1352
    @matts.1352 9 หลายเดือนก่อน +577

    One LOD technique I like a lot especially in the mobile world where nanite-like LOD engines currently aren't feasible is Progressively Ordered Primitives/POP buffers. The core idea is to cluster vertices through quantization at different levels of precision and sort them such that lower-precision vertices are first in the vertex list and higher-precision ones are last. The end-result is you can change the LOD of a model just by changing the quantization level and how many vertices you choose to draw without storing any more data than original mesh used.
    The benefit is four-fold:
    - Artists only need to make one model with any arbitrary attributes
    - Cracks/seams can be handled perfectly
    - Can have dynamically adjustable LOD levels without popping from mesh swaps
    - Can stream in the extra vertices as they're needed or upload the whole vertex buffer once and instance multiple LOD levels in one draw call. The latter is useful as draw calls are still disproportionately expensive on mobile.
    Since you're sorting the vertex list anyway (which can be done in O(n) time) and vertex order within quantization clusters doesn't matter much, you can also sort them on a secondary level to maximize vertex fetch efficiency which is important for mobile because of binning (more-so than overdraw since tile-based deferred renderers often have near-perfect hidden surface removal). The neat thing is you can scale the quantization level according to how large quantized grid would make triangles appear on screen to help maximize quad occupancy while maintaining enough detail for it to look good.
    It does have some drawbacks, notably that it doesn't play well with vertex skinning and lower LOD levels tend to have a bit more triangles than hand-made LODs, but it's great in a mobile environment or for procedural mesh LODs.
    As a side note, optimizing for mobile tile-based deferred-rendering is a lot of fun and it feels so much more rewarding to make a mobile engine run fast. Most mobile developers just port PC games or graphics techniques to mobile as-is and call it a day while limiting gameplay to negate poor performance; however, with careful optimization you can achieve between Xbox 360 and Xbox One levels of performance on most modern (>5 year old) mobile hardware. I'm definitely biased though as I've found my niche in mobile optimization.

    • @SimonBuchanNz
      @SimonBuchanNz 9 หลายเดือนก่อน +25

      That sounds amazingly cool! Can you say what you've worked on?
      (Sorry for going of on a tangent here, I'm just musing 😅)
      I've always found the capability of these mobile chips to be bizarrely good, so seeing stuff like full Resident Evil running on an iphone wasn't *that* shocking, but I've found it weird and a bit disappointing that despite this the market doesn't seem to care about it at all.
      The Steam Deck and Switch's popularity implies there are absolutely potential buyers, but why aren't they biting? It's far cheaper to get a game controller attachment than these devices after all, so it shouldn't be input. Is it just *relative* to the Candy Crushes of the world that they don't show up? Poor app store presentation? Investors not willing to back it?

    • @4.0.4
      @4.0.4 9 หลายเดือนก่อน +10

      Awesome, I didn't know that. As someone who hopes to someday make a mobile game that isn't hot garbage this gives me hope.

    • @crimson-foxtwitch2581
      @crimson-foxtwitch2581 9 หลายเดือนก่อน +41

      ⁠​⁠​⁠​⁠​⁠​⁠​⁠@@SimonBuchanNzThis is due to a couple of factors.
      1: The mobile market has been dominated by F2P monetization models since the mid-2010s. Paying even $5 for a fully-fledged game was a thing of the past a very long time ago, especially when you consider regions that the mobile market is at its most popular in: the whale-hunting model is especially effective.
      2: The Switch is a 7-year-old console powered by a 12-year-old chipset. High-end smartphones exceeded the Switch’s hardware capabilities years ago, especially Apple hardware.
      3: The iOS port of RE8 is exclusive to the highest-end versions of Apple’s highest-end smartphone, limiting potential audience significantly.
      4: Resident Evil 8 is built upon a game engine *designed* for modern scalability on top of being a game originally built for last-generation home consoles as-is, as opposed to now where the hardware difference between the average smartphone and a current-generation console is somewhere in the ballpark of 3000%.
      5. Newer console hardware and game engines has allowed developers to spend less and less time needing to optimize their assets, but it’s still faster and easier to build your game from-the-ground-up around the hardware you’re targeting anyway, especially if expensive graphical effects are integral to gameplay systems and your game’s art direction.

    • @comanderlucky656
      @comanderlucky656 9 หลายเดือนก่อน +38

      I like your funny words magic technology man

    • @thewhitefalcon8539
      @thewhitefalcon8539 9 หลายเดือนก่อน +2

      I don't see how that would work. You could order the vertex buffer, sure, but you'd still need entirely separate index data for each optimization level.

  • @techno_tuna
    @techno_tuna 9 หลายเดือนก่อน +474

    I am an absolute novice at game dev whos been toying around in Unity and now Godot for about two years, and I have to say each one of your videos feels like I should be paying you for this kind of info. The fact that this is your FREE content is insane, and I'm excited to see what your paid content looks like. You have a knack for beginning small, simple, and approachable, and then expanding to the point that I'm pausing and writing things down and yet still not feeling overwhelmed. I've read through documentation and white papers before for plenty of other coding subjects, but nothing has ever made me WANT to like your videos do.

    • @mikkelens
      @mikkelens 9 หลายเดือนก่อน +3

      If you want to get into paid resources worth their price(?) then I hear books on graphics programming are good.

    • @WrongButtonWB
      @WrongButtonWB 9 หลายเดือนก่อน +3

      @Techno tuna, but you are paying it, by watching it dude.

    • @maythesciencebewithyou
      @maythesciencebewithyou 9 หลายเดือนก่อน +2

      Nothing is stopping you from donating

    • @KiraSlith
      @KiraSlith 9 หลายเดือนก่อน +2

      He DOES have a Teachable if you want to buy full courses.

  • @perplexedon9834
    @perplexedon9834 9 หลายเดือนก่อน +187

    Ill confess, even up until about 10 minites in I has assumed, naively, that I did actually understand why billboards were so kuch better for performance, with thoughts along the line of "reading a precalculated viewing angle from a file/ram is MUCH cheaper than doing the matrix multiplication to calculate the physics perspective appearance of even a very low poly 3D mode".
    When you went through how the physically arcitexture of the GPU differently handles triangles with a size close to the pixel size, it was so mind-blowing. I haven't realised I was so starkly wrong about something in a while, such a great feeling!

    • @thewhitefalcon8539
      @thewhitefalcon8539 9 หลายเดือนก่อน +20

      That may have been the case a very long time ago. In 2016 my laptop still ran vertex shaders on the CPU. Intel GMA architecture - absolute garbage. Thankfully Intel stopped using that architecture and put real GPUs on their chips.

    • @naxzed_it
      @naxzed_it 7 หลายเดือนก่อน

      ​@@thewhitefalcon8539They stopped using that way before 2016

    • @rockoman100
      @rockoman100 2 หลายเดือนก่อน +1

      i dont understand

    • @TijmenHatesads
      @TijmenHatesads หลายเดือนก่อน +1

      ​@@thewhitefalcon8539that's such a nice way of saying "You might not have been wrong, maybe you're just poor."

  • @11nephilim
    @11nephilim 9 หลายเดือนก่อน +100

    Really enjoyed this! As an artist you get told what things to avoid - e.g. long thin triangles and extremely small triangles, but it's rare to get a good explainer on *why*.

  • @needleful
    @needleful 9 หลายเดือนก่อน +334

    14:30 the "maximum area" triangulation improving performance so much is news to me! I usually create slices since it looks "nicer", but I should probably think more about the final result.

    • @blarghblargh
      @blarghblargh 9 หลายเดือนก่อน +55

      optimize when you know your targets and know you're going to need it. you can't "optimize" everything. it's wasteful.
      if you can do "worst case" example scene and figure out how well it runs on the hardware you're targeting, then you might be able to get some inkling early of how to balance things out and spend your texture/triangle budgets.
      For the art itself, most people suggest trying to get even, quad-only topology. They don't tend to worry about triangulation performance. Stuff like UV density and how textures stretch when animated tend to dominate.
      For example, I wouldn't even bother trying to act on stuff like that "maximum area" example vs triangle fans, etc. That was more a tech demonstration to exacerbate a problem, rather than sound art advice. That all said, it can be good to avoid long, thin stuff when possible and to prefer workarounds, especially at lower LODs. For example maybe it's best at low LOD to have a flat quad with an alpha cutout texture instead of a lot of polygons that form boards on a bridge.
      And when making LODs, it can be good to zoom an object out to its max distance it will appear at that LOD, look in wireframe, and see how big polygons are compared to pixels they cover. That can give you some good ideas of how to spread out detail, and where to reduce detail.
      Or, if you're using UE5, enable nanite. Then you don't have to care :D

    • @thecat8411
      @thecat8411 9 หลายเดือนก่อน +27

      The maximum area is probably also the worst looking. Many rendering techniques look way better when the density of triangles is uniform along a model surface. And this is more important than small optimization.

    • @crimson-foxtwitch2581
      @crimson-foxtwitch2581 9 หลายเดือนก่อน +22

      @@blarghblarghActually, for the “maximum area” triangulation I found a method in Blender that makes these triangulations super easy to construct.
      Step 0: Bind Checker Deselect to a key of your choice(here I’ll use Mouse5)
      Step 1: Select the circle loop
      Step 2: Alternate Mouse5 and F until you’ve run out of smaller triangulations/hit an edge
      Step 3: Select all the overlapping faces and hit Delete, then “Only Faces”
      Step 4: Select the edge loop, it’ll automatically select every edge in there
      Congratulations, you now have an optimally-triangulated circle.

    • @blarghblargh
      @blarghblargh 9 หลายเดือนก่อน

      @@crimson-foxtwitch2581 nice one! thanks for the tip

    • @needleful
      @needleful 9 หลายเดือนก่อน +2

      @@thecat8411 I usually do the triangle fan out of habit, even on flat geometry or the bottoms of things. There are certainly cases where it'll look worse, but also cases where I did more work (usually extruding a default circular face and merging it to a point) for less than zero gain. Blender's default circle triangulation looks nearly the same as this "max area" algorithm.

  • @frendoman
    @frendoman 7 หลายเดือนก่อน +169

    "Mommy, Daddy, where do pixels come from?"
    SimonDev: "Sit down son"

    • @HupfderFloh
      @HupfderFloh 4 หลายเดือนก่อน +7

      See, when tree vertices love each other very much...

  • @Baltic_Dude
    @Baltic_Dude 9 หลายเดือนก่อน +111

    Now send this to Mortal Kombat

    • @danifurka6790
      @danifurka6790 6 หลายเดือนก่อน +4

      What do you mean? I don't get it

    • @garr_inc
      @garr_inc 5 หลายเดือนก่อน +14

      @@danifurka6790
      I assume this relates to how much downgrade models had to receive for the Nintendo Switch port of MK1.

    • @sagichdochned
      @sagichdochned 5 หลายเดือนก่อน +5

      Would say to Cloud Imperium Games for Star Citizen First :D

    • @PositionOffsetGaming
      @PositionOffsetGaming 3 หลายเดือนก่อน +1

      MK11, and to a lesser extent MK "1" (12), have some of the most impressive and advanced rendering in realtime graphics. MK11 looks almost as good as some UE5 games today despite being made on UE3. They're aware...

  • @krystofjakubek9376
    @krystofjakubek9376 9 หลายเดือนก่อน +103

    This is a very nice video! I have heard avoiding micro triangles is a big thing but now I finally know why

    • @func8211
      @func8211 9 หลายเดือนก่อน +4

      Nice profile pic

  • @Eckster
    @Eckster 9 หลายเดือนก่อน +108

    Wow, this really explains why Nanite was such a breakthrough, of course it doesn't dig into the difficulty of implementation, but it does show how it eliminates the excessive tiny triangle issue.

    • @lanchanoinguyen2914
      @lanchanoinguyen2914 9 หลายเดือนก่อน +5

      Raytracing will replace rasterization sometime and then nanite will have become obsolete.

    • @blarghblargh
      @blarghblargh 9 หลายเดือนก่อน +44

      @@lanchanoinguyen2914 we don't have a clear picture when that transition will happen, so a solution that solves problems now is still valuable, and understanding the limitations of those solutions is also valuable.
      it's like saying "eventually we'll have cheap consumer space travel, so who cares about light rail?". maybe not as extreme, since full scene tracing with no rasterization MIGHT be plausible within the next 10 years. but I kinda doubt it. not only does the hardware that supports such a thing have to come out, but everyone has to have bought it and phased out older hardware, too. unless you're saying current high end gaming hardware is already capable of doing zero rasterization and getting all the same level of effects. I am not sure I've heard anything that says that's true.

    • @valshaped
      @valshaped 9 หลายเดือนก่อน +48

      I doubt rasterization is going away any time soon, considering every GPU available today has a rasterization pipeline, and future GPUs have to keep a rasterization pipeline to maintain compatibility with
      - any web browser
      - any game on Steam (or GOG, or the Epic Games Store, etc.)
      - any commercial software
      Realtime raytracing is a neat addition to modern graphics cards, for sure, but it's not a silver bullet by any means.

    • @saniel2748
      @saniel2748 9 หลายเดือนก่อน +44


      1) We're extremely far from that happening
      2) It's not like raytracing is immune to triangle counts

    • @TheFunDimension
      @TheFunDimension 9 หลายเดือนก่อน +1

      @@lanchanoinguyen2914how come? What does one thing do with the other?

  • @jokered1133
    @jokered1133 9 หลายเดือนก่อน +20

    I don’t think I’ve seen anything that talk about this stuff in this much detail, much respect for your career choice, I am truly amazed at the information dump and how accessible you’ve made it, thank you.

  • @thyroid99
    @thyroid99 9 หลายเดือนก่อน +33

    From one 20+ year dev to another, your content is solid. And thanks for mentioning ATI!

  • @satana8157
    @satana8157 4 หลายเดือนก่อน +138

    Minecraft doesn't have LODs and its performance is embarrassing for the pixel art level of graphics it has.

    • @Draganox25
      @Draganox25 4 หลายเดือนก่อน +12

      If you're on pc, you can get the distant horizons mod that adds lods

    • @satana8157
      @satana8157 4 หลายเดือนก่อน +6

      @@Draganox25 Yeah I'm aware of the mod but it seems like it's not polished yet.

    • @Draganox25
      @Draganox25 4 หลายเดือนก่อน +4

      @satana8157 idk why you think that I use it and it looks fine to me let's me have a render distance of like 300 but only render like 32 chunks

    • @satana8157
      @satana8157 4 หลายเดือนก่อน +6

      ​@@Draganox25 I've seen some buggy videos, Maybe they fixed them then.
      My system is low end. Can it improve chunk loading speed if I keep the render distance pretty low?

    • @lemonlordminecraft
      @lemonlordminecraft 4 หลายเดือนก่อน +2

      Feel like getting the mod and trying it out can’t hurt

  • @rileymoore7025
    @rileymoore7025 9 หลายเดือนก่อน +38

    One iconic example of billboards is the infamous 1000 Heartless fight in Kingdom Hearts 2. Back then, it would've been impossible to have 1000 entities individually moving and acting all at the same time. Square Enix's work around to this was to have only have a handful of active enemies actually nearby. Meanwhile, the rest of the Heartless would be represented by these 'billboards'. While in combat, it's hard to notice this detail at first, but it's extremely obvious on repeat playthroughs. This work around is also present when there's a swarm of Rapid Thrusters, except it's a lot less noticeable since the enemies are flying above you and often spawn offscreen rather than right in front of the player.

    • @Wutwut1n1
      @Wutwut1n1 4 หลายเดือนก่อน +3

      Just watched that, good example 👍

    • @BlindBosnian
      @BlindBosnian 4 หลายเดือนก่อน +2

      It's similar with the cabin fight in the RE4 remake. You will see the ganado horde on the outside of the cabin, but they don't react to your shots and only a small amount of ganados is trying to enter the cabin at a time

    • @GeorgeTsiros
      @GeorgeTsiros 2 หลายเดือนก่อน

      didn't serious sam support hundreds of active models back in 2001?

  • @PaulSpades
    @PaulSpades 9 หลายเดือนก่อน +48

    Amazing. Not only does this explain why intuition fails, but you also back up the technical reasoning with real industry solutions.
    Watching your videos, I somehow always come out with more information than I was expecting. Well done!

  • @Kolyasisan
    @Kolyasisan 9 หลายเดือนก่อน +18

    The moment I saw the 2x2 grid in the thumbnail and you mentioned LODs I knew exactly that this is gonna be about workflow scheduling for fragment shaders. Great video as always

  • @comatose3788
    @comatose3788 8 หลายเดือนก่อน +3

    Very well done. Even covered some stuff that never gets covered, like how billboards are used along with LOD and occlusion culling. They never talk about billboards. First time I made a model doing all this, I was floored at how well it worked. This stuff changed everything back in the day. You also said one of my favorite words, lol ... Automagically.

  • @pizzamonkey7801
    @pizzamonkey7801 9 หลายเดือนก่อน +19

    this channel is insanely underrated. The amount of knowledge you offer with each video is absolutely great. Pleas keep this up.

  • @AyushBakshi
    @AyushBakshi 9 หลายเดือนก่อน +10

    Creators like you kept me motivated and today I'm a 3D and tech artist in my team. I'm not hardcore into graphics programming (yet) but I'm learning! One baby step at a time.

  • @TXanders
    @TXanders 9 หลายเดือนก่อน +11

    Finally! I no longer have to explain this every other day. Another brilliant coverage, well executed.

    • @Inferryu
      @Inferryu 9 หลายเดือนก่อน +3

      I mean, you'll still have to link to it every other day :D

  • @bhupesh_singh
    @bhupesh_singh 9 หลายเดือนก่อน +11

    my adblocker works completely fine, but for your videos I always pause my adblockers so that I watch the ads and support your wonderful content ✨✨

    • @simondev758
      @simondev758  9 หลายเดือนก่อน +3

      Hah thanks so much! Very appreciated :)

  • @DKannji
    @DKannji 9 หลายเดือนก่อน +4

    In intro to 3D modelling we were told very sternly, "keep the triangle count to a minimum, and remove as many unnecessary triangles as possible.
    So this is nothing new to my ears... but it is fun to listen to anyways.

  • @NeoToXo
    @NeoToXo 7 หลายเดือนก่อน +3

    I just found this through a reddit comment and I didn't knew how much I needed to know this. Thank so much for explaining. This is so valuable and I definetely will buy ad start with your game math course. Awesome stuff

  • @hoffer_moment
    @hoffer_moment 9 หลายเดือนก่อน +4

    been hoping for content like this for many years, great job

  • @nebuchadnezzer2436
    @nebuchadnezzer2436 9 หลายเดือนก่อน +2

    You just explained what is, really, a constant, complex technical process, in a way pretty much anyone can follow and understand, and that's more than can be said for a lot of teachers/professors...
    I knew, for instance, a good amount of what was covered here, just accumulated knowledge over the years of gaming and satisfying my curiosity, and guessed cranking up sheer volume of triangles would tank FPS pretty quickly... But now, I actually understand *why*...
    Thanks for the insight 😍

  • @gio3061
    @gio3061 9 หลายเดือนก่อน +18

    In 2004, I was 4. Feels like I should've been a graphics engineer and bought a house, instead of being a little child. Shame on me. Seems that I've destroyed my opportunities with this one simple mistake.

    • @simondev758
      @simondev758  9 หลายเดือนก่อน +7

      Bet you won't make that mistake again.

  • @CyberWolf755
    @CyberWolf755 9 หลายเดือนก่อน +16

    Amazing video. Really like when I stumble on such high quality content.
    Would really like a part 2 covering Nanite and maybe alternatives other people developed

  • @kusshh_xo
    @kusshh_xo 9 หลายเดือนก่อน +2

    This video is an absolute gem, Really thank you Simon for putting this up and making this so straightforward. Really got some deep insights about how GPU's work.

  • @mysparetime1541
    @mysparetime1541 9 หลายเดือนก่อน +1

    You have by far the best game optimization content I’ve come across, can really tell that you know what you’re talking about and not just repeating words said by someone else. Really love the way you explain things with just the right amount of information for you to be able to understand these things fundamentally and really grasp the mechanics 🙏🙏

  • @RedPandaTables
    @RedPandaTables 9 หลายเดือนก่อน +3

    Personally learn this naturally from playing Just Cause 2. Amazing game that use this extremely well. popping in/out is noticeable if you look for it but if casually playing it doesn't take you out of the moment.

  • @xlerb2286
    @xlerb2286 9 หลายเดือนก่อน +4

    Very informative. I'm not a game developer but I enjoy understanding some of the complexities. I didn't realize that small triangles were an issue. Not being familiar with how gpu's work at that level of detail I'd figured a triangle is a triangle.

  • @UsaraDark
    @UsaraDark 9 หลายเดือนก่อน +3

    These are the kinds of videos that inspire me to one day dive into the graphics side of computers. I've always wanted to touch shaders and 3D modeling, but it has always felt beyond my understanding. This helps.

  • @Purpial
    @Purpial 9 หลายเดือนก่อน +4

    This is so incredibly in-depth, I learned so much from this video

  • @miguelnobre9788
    @miguelnobre9788 9 หลายเดือนก่อน +5

    because of you i found Lexx, been trying to find it for the past 15-20 years :D thank you, subscription deserved from all the info (and the extra one)!!!

    • @simondev758
      @simondev758  9 หลายเดือนก่อน +2

      May his divine shadow fall upon you.

    • @lhb82
      @lhb82 9 หลายเดือนก่อน

      @@simondev758 More people need to know about Lexx!

  • @benridesbikes6975
    @benridesbikes6975 7 หลายเดือนก่อน +2

    You are an excellent teacher, the language and presentation was so easy to process even though I am only tangentially related to the topic, thanks for making this!

  • @millerbyte
    @millerbyte 9 หลายเดือนก่อน +5

    This is fantastic, thank you Simon! Just bought both of your courses, looking forward to diving in.

    • @simondev758
      @simondev758  9 หลายเดือนก่อน

      Hope you get a lot out of them!

  • @MantridJones
    @MantridJones 7 หลายเดือนก่อน +3

    03:34 You just made my day for putting in Kai from the best TV Show ever made!

  • @DevDunkStudio
    @DevDunkStudio 9 หลายเดือนก่อน +6

    This is an amazing video.
    Really good information in a short and clear video. And this is information that is not found too often on TH-cam.
    Much appreciated!

  • @Decodeish1
    @Decodeish1 9 หลายเดือนก่อน +1

    Thank you a ton for the captions, really made it much easier to follow! Great video :)

  • @meezemusic
    @meezemusic 9 หลายเดือนก่อน +9

    Simon! You are a saint! You even reference everything you mention. S tier content 👌

  • @jumpsneak
    @jumpsneak 2 หลายเดือนก่อน

    I am thankful to my brain for deciding to click on this video after looking at the thumbnail for 10 seconds.
    This is one of the most grateful ways I discovered a brilliant new channel. Amazing work!

  • @siristhedragon
    @siristhedragon 9 หลายเดือนก่อน +16

    This flipped a switch in my head; this all makes WAY more sense now!
    Thank you!

    • @crackedemerald4930
      @crackedemerald4930 9 หลายเดือนก่อน +2

      Good luck on your ventures, non-binary black horned dragon dev! Take it ez! 😎 👍

    • @publicalias8172
      @publicalias8172 9 หลายเดือนก่อน

      delete this @@crackedemerald4930

  • @firun2635
    @firun2635 9 หลายเดือนก่อน +1

    The Crew Motorfest uses imposters (or, as I got to know them many years ago, sprites) for trees when you're using the map and zoom out. It's a nice throwback to when I started PC gaming.

  • @TheNumberOfTheBeast666
    @TheNumberOfTheBeast666 9 หลายเดือนก่อน +5

    Rare Lexx reference! Don't think some of us didn't spot that.

    • @simondev758
      @simondev758  9 หลายเดือนก่อน +3

      May his divine shadow fall upon you.

    • @lhb82
      @lhb82 9 หลายเดือนก่อน

      No mentioning of Lexx will go unnoticed ;D

    • @sean7221
      @sean7221 5 หลายเดือนก่อน

      Hell yeah was searching for this comment!

  • @nameno7032
    @nameno7032 5 หลายเดือนก่อน

    Please have more content in depth like this, no one ever talk about this deep, Thank you for your clear explaination

  • @danolantern6030
    @danolantern6030 9 หลายเดือนก่อน +3

    The best example i can call off from memory is GMOD’s Flattywood sign. I always thought it was a model from the distance it was. It isn’t.

  • @sorialexandre
    @sorialexandre 9 หลายเดือนก่อน +2

    It blows my mind... one more day that I realize I know nothing. Thank you very much!! Perfect explanation and quality like always.

  • @FlipOfficial
    @FlipOfficial 9 หลายเดือนก่อน +2

    This is a joy to watch! Glad I found this video 🙏🏼

  • @EmersonPeters
    @EmersonPeters 7 หลายเดือนก่อน +2

    Thank you so so so much for this, this is extremely helpful!

  • @socks2423
    @socks2423 6 หลายเดือนก่อน +4

    My takeaway. Please let me know if I'm wrong: small triangles are less efficient with their quads and thus are inherently more inefficient but the main reason they are inefficient is that the triangle assembly is a linear process that assums you will have less triangles than pixels and gets bogged down when it can't keep up with the assembly load. So the bottleneck is the assembly.

  • @Nebb_
    @Nebb_ 9 หลายเดือนก่อน +1

    Very interesting video, I’ve always wondered why more/smaller verticies were taxing on preformace and I’m glad that you made a video explaining it in a easy to understand manner

  • @Backup1982
    @Backup1982 9 หลายเดือนก่อน +27

    Thanks for this deep dive. You should use frame time instead of FPS on your plots though. As FPS is a reciprocal and thus makes reading the plot much harder.

    • @simondev758
      @simondev758  9 หลายเดือนก่อน +39

      Yeah, I had to choose 1 of 2 ways. I try to err on the side of caution, and go with what I *think* more people will understand intuitively.
      Go with fps, that's broad, less technical people will understand this but at the annoyance of technical people. The technical people should still, in theory, understand just fine though.
      Go with frametime, you end up potentially confusing less technical people, but it's a more straightforward metric for technical people.
      Either way, you're wrong for a % of people, unless you spend extra video time explaining your choice.

    • @Robloxtopfive
      @Robloxtopfive 9 หลายเดือนก่อน +2

      @@simondev758 Funny that you mention Humus in the video, I read the FPS vs. framerate post only today. It annoys him so much in scientific settings but for the average person 60 FPS is more intuitive than 16.666 MS frame time. I'm trying to benchmark my GLSL shader library, in your experience Simon, any other pitfalls when benchmarking fragment shaders or presenting results in an academic setting? Love the videos by the way, you got me to recreate a OSRS demo in Three.js.

    • @simondev758
      @simondev758  9 หลายเดือนก่อน +1

      No amazing advice off the top of my head, only to think about what audience you're presenting to.

    • @Robloxtopfive
      @Robloxtopfive 9 หลายเดือนก่อน

      @@simondev758 Thanks, that’s a fair point, I may actually watch tutorial videos on shaders to see how they’re explained. I’m presenting to those with a broad knowledge of CS but not necessarily anything graphics related.

  • @ch3dsmaxuser
    @ch3dsmaxuser 9 หลายเดือนก่อน +2

    Dude, this video is awesome! You have such a straightforward way to express your knowledge that I, not a graphics programmer, get it. Kudos!

  • @ilya238
    @ilya238 9 หลายเดือนก่อน +3

    Thanks, this was actually really interesting and helpful!

  • @ThatTrueCJ201
    @ThatTrueCJ201 9 หลายเดือนก่อน +2

    One thing I would love you to touch on is mesh shaders, which upon a quick search, allows developers to send geometry data to the rasterizer directly. This technology isn't common place yet, but seems like an alternative to the primitive assembly and will probably grow with popularity, as all modern consoles support this.

    • @simondev758
      @simondev758  9 หลายเดือนก่อน

      I hope to do a video on them one day

  • @sabrina0013
    @sabrina0013 9 หลายเดือนก่อน +4

    This is fascinating, and very helpful. Thank you!

  • @Zenairo
    @Zenairo 9 หลายเดือนก่อน +1

    You're amazing, glad to have found this channel. Thank you for this easy to understand and useful information!

  • @mikzart
    @mikzart 9 หลายเดือนก่อน +5

    So interesting. Thanks for your work, I really appreciate it!

    • @mikzart
      @mikzart 9 หลายเดือนก่อน +1

      I will watch it several times with taking notes and watch other resources to create understanding

  • @sporefergieboy10
    @sporefergieboy10 9 หลายเดือนก่อน +1

    I’m so sad to hear about your esophagus cancer 😢. You’re my favorite youtuber I hope you get better soon!

  • @PrismaticaDev
    @PrismaticaDev 9 หลายเดือนก่อน +4

    Great video :) This will be my go-to when people ask about mesh-related optimization!

  • @herlantmajor5883
    @herlantmajor5883 9 หลายเดือนก่อน +1

    Amazing video as always man! I have heard about this on the surface in my career, never found something that covers this subject with so much ease before, thanks

  • @antoine9765
    @antoine9765 9 หลายเดือนก่อน +4

    For reasons unclear to me, I find 2d "fake" trees being called impostors extremely funny.

  • @lemonjumpsofficial
    @lemonjumpsofficial 9 หลายเดือนก่อน +1

    so far the best ones I've seen were done using a shader, 2 depth maps and 2images.
    it's a custom thing players do for vrchat, and it's so good, that it looks like the actual object just with couple of glitches.
    the genius of it is that it can encode essentially a 3d video. so it's used for stage performances for low power headsets such as oculus rift.

  • @sheridancarter78
    @sheridancarter78 9 หลายเดือนก่อน +3

    this is a really good video and also i really like your voice. thank you for making this 😊

  • @あき-h6u
    @あき-h6u 4 หลายเดือนก่อน +2

    This voice is so recognizable that I only watched 1 video a year ago after stumbling on it while on my game dev journey
    Now, another video pops up suddenly and I click on it, just to recognize the voice immediately 😂

  • @ivanalantiev2397
    @ivanalantiev2397 9 หลายเดือนก่อน +9

    What a great video, I kinda always wanted to understand the performance optimization techniques better, but just didn't really had time to do the digging. Thank you for your effort!

  • @Iridium.
    @Iridium. 3 หลายเดือนก่อน

    Absolutely fantastic video ! Now I finally understand understood how triangle size and shape affects performance ! Although in my journey in graphics , the post fx has always been more expensive than just triangles

  • @SpadesNeil
    @SpadesNeil 9 หลายเดือนก่อน +6

    Someone show this to Bohemia devs.

  • @ludologian
    @ludologian 9 หลายเดือนก่อน +1

    thanks for sharing as a solo gamedev who do read scientific research papers on the nitty gritty details but done nothing impressive I really appreciate simpler explanation that comes from experienced developer. thx again

    • @ludologian
      @ludologian 9 หลายเดือนก่อน

      I'm also interested in learning about middlewares like sinplygon and unitys scriptable rendering pipeline.
      I.e visibility buffer, forward+ and bindless texture

  • @SchadenfreudeUY
    @SchadenfreudeUY 9 หลายเดือนก่อน +3

    1:23 my guy was lowkey hitting that shit, look at him

  • @Shabazza84
    @Shabazza84 9 หลายเดือนก่อน +2

    Absolutely love the way you explain things.

  • @makebreakrepeat
    @makebreakrepeat 9 หลายเดือนก่อน +8

    Stay optimistic fellow devs!

  • @vazquezsebastian9764
    @vazquezsebastian9764 9 หลายเดือนก่อน +2

    Great video, very technical but clear witch is really great ! If you like it, keep up the good work

  • @ColinPaddock
    @ColinPaddock 9 หลายเดือนก่อน +5

    Back in the ‘90s, Marathon used “imposters” for all of the moving characters. They were all sprites. Animated sprites with versions from 8 separate angles.

  • @PureLogic7121
    @PureLogic7121 4 หลายเดือนก่อน

    as a aspiring game dev with multiple game devs under my leadership. I HIGHLY appreciate the information. ITS GOLDEN. If i ever get any funds to throw away your getting the first bit of it.

  • @zawa5243
    @zawa5243 9 หลายเดือนก่อน +12

    Thank you Bob from Bobs burgers for explaining graphics optimization

  • @MargudnRoboter
    @MargudnRoboter 9 หลายเดือนก่อน +1

    Thank you for this Video. This is such an eye opening thing! I would have never assumed that the rasterizer expects triangles that are atleast 4x4 pixels. I have always assumed, that is was a simple less stuff render

  • @p529.
    @p529. 9 หลายเดือนก่อน +3

    Crazy informative video! Big props

  • @kawashirov
    @kawashirov 9 หลายเดือนก่อน +2

    Each time someone in VRChat says their half-milion triangles avatar is not laggy, give them this video. Because (beside pretty obious and everywhere well-explained problem with vram, material slots, batching) there is no LODs for avatars and that's exactly what happening with their every detailed button on clothes or fluff on tail when they 10+ meters far from you: ITS JUST LESS THAN PIXELS, BUT ASSEMBLER HAD TO PROCESS THAT TINY UNSEEN SHIT AND PASS TO THAT 2x2 CLUSTERS TO RUN THAT LAGGY POYOMI FRAGMENT SHADER MULTIPLE TIMES PER-PIXEL JUST TO DISCARD OUTPUT.

  • @JackPotniy
    @JackPotniy 9 หลายเดือนก่อน +3

    This is golden content.

  • @aabbccddeeffgg1234
    @aabbccddeeffgg1234 2 หลายเดือนก่อน

    hey, nice video. This reminded me of the game overgrowth, that game have a unique solution for the lower quality objects on longer distance, that game dev invented an algorithm that smoothly reduces the amount of polygons of objects the longer the distance it is from camera, this reduced the amount of work needed for any artists as only need 1 version of each object and also increased optimization as the game dont need to keep loading new objects all the time. The dev shows it off it in his alpha video 206, not heard of other games using same solution.

  • @sanderwrong9106
    @sanderwrong9106 9 หลายเดือนก่อน +6

    Thank you, Bob's Burgers

  • @isogash
    @isogash หลายเดือนก่อน

    Top notch video, really helped me fill some gaps in understanding quad overdraw and nanite!

  • @Memeieli
    @Memeieli 9 หลายเดือนก่อน +3

    I also wanted to give some attention to Nanotech, a Unity version of Nanite nearing release.

  • @TheDarkestPaladin
    @TheDarkestPaladin 9 หลายเดือนก่อน +2

    Great video, even tho I doubt I'll be working as a game dev. I am always happy to learn more about optimisation methods, who knows I might make my game as a small hobby passion project on the side

  • @IAm18PercentCarbon
    @IAm18PercentCarbon 9 หลายเดือนก่อน +4

    This sounds like H. Jon Benjamin is teaching me computer graphics, and it's amazing.
    Great explanation of 2x2 quads, friend! I'd never really understood how to optimize for them before and I'm already excited to apply some of these tips.

    • @gogbone
      @gogbone 9 หลายเดือนก่อน

      i was thinking the same thing lmfao

  • @corporal381
    @corporal381 9 หลายเดือนก่อน +2

    You have a wonderful Bobs Burgers deadpan delivery that just works.

  • @preston7309
    @preston7309 9 หลายเดือนก่อน +7

    "Imposters"
    get out of my head
    get out of my head
    get out of my head

  • @SloppyPuppy
    @SloppyPuppy 9 หลายเดือนก่อน +2

    Just love the John Carmack quote on the article 😂

  • @chrismcelligottpark6416
    @chrismcelligottpark6416 9 หลายเดือนก่อน +4

    Love this - this is not at all how I think of the gpu pipeline, so this way eye opening. It would be very interesting to hear about instancing and batches in the context of all this.
    For example, I presume that even though instancing saves a bunch of one kind of work, it does not save any work when it comes to this step. Or maybe I’m wrong!
    And I’ve always heard that batch count, or alternatively setpass call count, was the biggest bottleneck.
    So for something with a ton of different impostors or LODs, that might really reduce the triangle count on screen, but could double or quadruple the batch count. Obviously tuning is a series of tradeoffs, but I’m very curious on your take on this.

    • @simondev758
      @simondev758  9 หลายเดือนก่อน +4

      Instancing mostly saves you on that "driver" step, in that you don't have to make X calls to the API in order to draw X copies of an object.
      Performance for GPU's is complex, simple rules like watching your batch count and using LOD's work, but they're just that, simple rules that are meant to cover up the complexity.

    • @lanchanoinguyen2914
      @lanchanoinguyen2914 9 หลายเดือนก่อน +1

      Of course wasted draw calls affects both cpu and gpu performance.When cpu has performance issues,it's much worse than gpu because you can notice the dramatic fps drop.Draw calls are your enemy not tiny triangles because draw calls *start* the entire rendering pipeline not triangles,it's simple like that.Drawing one LOD at a time doesn't increase any draw call at all logically.

  • @MatBat__
    @MatBat__ 9 หลายเดือนก่อน +2

    Dude, amazing content. How interesting!
    Thx for sharing your knowledge, cheers!

  • @totheknee
    @totheknee 9 หลายเดือนก่อน +3

    This is good info. Mindless gamers fall for the mythology of "GPU is a God and can do everything faster than a CPU. CPU graphics are for noobs and old people." And yet, a software rasterizer can be 3x faster than a GPU (21:42). And I've seen another example of a software rasterizer that gets over 500 FPS. My entire engine is software/CPU only (except the OS probably blits the final image using a GPU), without any optimizations so far, and it runs faster than a human can perceive the frame rate. I wish more devs would utilize 8 or 16 CPU cores and stop blindly worshipping the GPU God so much. Instead, we get stuck with the "moar corez doesn't matter, devs doesn't know how 2 prgram teh moer corz!" propaganda myth.

    • @jcm2606
      @jcm2606 3 หลายเดือนก่อน +1

      In this context, software doesn't mean running on the CPU, rather it means that Epic wrote their own rasteriser in a set of compute shaders that bypasses the GPUs built-in rasterisation hardware. It's still running on the GPU due to using compute shaders, but it's using software to do the heavy lifting rather than hardware.

  • @Gwizz1027
    @Gwizz1027 9 หลายเดือนก่อน +2

    I really enjoy learning about optimization from Bob Belcher

  • @ThePigasaurus
    @ThePigasaurus 4 หลายเดือนก่อน +10

    You sound like Bob burger

    • @Mikee512
      @Mikee512 หลายเดือนก่อน +2

      You sound like keyboard key clacking

  • @irongiantftw6295
    @irongiantftw6295 25 วันที่ผ่านมา

    1:30 A friend of mine realized a favorite game of her is actual partial horror just because they used a drastic level of simplicity.
    I'm talking about Genshin Impact. When the player is far away enough, it turns the trees not into simple 3D objects, but in images, images that always face you(the player model). In the game, there's a glyding function too, so you can slowly descend from high places. You can use this to be high up and slowly see the trees loading it's 3D counterpart. But before that, you can see a tree dancing around you in it's png form when you wiggle in the around high above it.