Microsoft's New AI: Virtual Humans Became Real! 🤯

แชร์
ฝัง
  • เผยแพร่เมื่อ 23 ก.ค. 2024
  • ❤️ Check out Runway and try it for free here: runwayml.com/papers/
    📝 The paper "3D Face Reconstruction with Dense Landmarks" is available here:
    microsoft.github.io/DenseLand...
    🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
    Aleksandr Mashrabov, Alex Balfanz, Alex Haro, Andrew Melnychuk, Benji Rabhan, Bryan Learn, B Shang, Christian Ahlin, Eric Martel, Geronimo Moralez, Gordon Child, Ivo Galic, Jace O'Brien, Jack Lukic, John Le, Jonas, Jonathan, Kenneth Davis, Klaus Busse, Kyle Davis, Lorin Atzberger, Lukas Biewald, Matthew Allen Fisher, Michael Albrecht, Michael Tedder, Nevin Spoljaric, Nikhil Velpanur, Owen Campbell-Moore, Owen Skarpness, Rajarshi Nigam, Ramsey Elbasheer, Steef, Taras Bobrovytsky, Ted Johnson, Thomas Krcmar, Timothy Sum Hon Mun, Torsten Reil, Tybie Fitzhugh, Ueli Gallizzi.
    If you wish to appear here or pick up other perks, click here: / twominutepapers
    Chapters:
    0:00 - Teaser
    0:19 - Use virtual worlds!
    0:39 Is that a good idea?
    1:28 Does this really work?
    1:51 Now 10 times more!
    2:13 Previous method
    2:35 New method
    3:15 It gets better!
    3:52 From simulation to reality
    4:35 "Gloves"
    5:07 How fast is it?
    5:35 VS Apple's ARKit
    6:25 Application to DeepFakes
    Thumbnail background design: Felícia Zsolnai-Fehér - felicia.hu
    Károly Zsolnai-Fehér's links:
    Instagram: / twominutepapers
    Twitter: / twominutepapers
    Web: cg.tuwien.ac.at/~zsolnai/
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 635

  • @gogokowai
    @gogokowai ปีที่แล้ว +304

    "...even the movement of our nostrils will be translated. What a time to be alive!" - Dr. Károly Zsolnai-Fehér, nose enthusiast

    • @user-wq9mw2xz3j
      @user-wq9mw2xz3j ปีที่แล้ว +1

      or a time to not be alive...

    • @4hm35319hd0h5
      @4hm35319hd0h5 ปีที่แล้ว +2

      @@user-wq9mw2xz3j To phrase it slightly less darkly, 'What a time to be virtually alive!'.

    • @raelrs
      @raelrs ปีที่แล้ว

      Please let it not track what’s coming out of the nose during winter… I’d prefer to be blissfully ignorant of that at least in VR! ;)

    • @emhgarlyyeung
      @emhgarlyyeung ปีที่แล้ว +1

      I just hope this technique won't be weaponize by some evil gov to make fake thing to smear others, instead, should use it in a good & fun way.

    • @rakijr9176
      @rakijr9176 ปีที่แล้ว +2

      So that's how his name is spelled.

  • @dundermifflinity
    @dundermifflinity ปีที่แล้ว +645

    Is there a better channel than demonstrates bleeding edge technology?
    No, this is it. If you’re not subscribed, you’re missing out.

    • @TwoMinutePapers
      @TwoMinutePapers  ปีที่แล้ว +78

      You are too kind Phil, thank you so much! 🙏

    • @dundermifflinity
      @dundermifflinity ปีที่แล้ว +56

      @@TwoMinutePapers no, thank you. I mean it. Nobody else does it like Two Minute Papers.

    • @Rodutchi
      @Rodutchi ปีที่แล้ว +7

      Absolutely correct

    • @lubbyLB
      @lubbyLB ปีที่แล้ว +8

      But he didn't talk about Stable Diffusion yet.

    • @dannyarcher6370
      @dannyarcher6370 ปีที่แล้ว +5

      I have not seen one. If you want to get ahead of the competition, especially when it comes to multimedia, just follow this channel to see what's coming down the pike.

  • @95TurboSol
    @95TurboSol ปีที่แล้ว +207

    Wow, can you imagine movies or games based on historical figures being reproduced by photo real 3d actors? This will be wild. Also terrifying that AI programs will probably be used for people to recreate and interact with digital versions of their dead spouses or children, this might mess some peoples heads up.

    • @Rctdcttecededtef
      @Rctdcttecededtef ปีที่แล้ว +18

      Imagine if we could some how train AI based popular literature and have it recreate the authors personality

    • @Numbabu
      @Numbabu ปีที่แล้ว +12

      @@Rctdcttecededtef I mean it can already imitate the writing style they used in their books, if you have a bunch of personal writings from a person I don’t see a reason you couldn’t generate text that matches them.

    • @GreatMCGamer
      @GreatMCGamer ปีที่แล้ว +8

      If you ask me...
      In order to lose your sanity due to virtual world, it must already be lost.
      Most commonly, being superstitious and or religious.

    • @Rctdcttecededtef
      @Rctdcttecededtef ปีที่แล้ว

      @@Numbabu that's a much better explanation of what I was imagining

    • @EliIud
      @EliIud ปีที่แล้ว +1

      @@GreatMCGamer that's cool but what if you're brought up under this tech since 0 years of age

  • @Druew
    @Druew ปีที่แล้ว +14

    We'll need a minimum of 10 more years of training for it to capture Jim Carrey's expressions accurately. It'll most likely be way to confused

    • @Dilligff
      @Dilligff ปีที่แล้ว +1

      He'll be the stress test for the technology. Kind of like the meme 'can it run Cryengine?', but it'll be 'but can it do Jim Carrey?'.

  • @BossKillRatio
    @BossKillRatio ปีที่แล้ว +49

    5:40 you could use this to insert yourself into a game and actually talk as yourself (not just for VR)... Imagine having virtual conversations with other players in a 3rd personal online game.. where other players talk realistically

    • @Jack-vv7zb
      @Jack-vv7zb ปีที่แล้ว +9

      Star citizen already does this. Called face over IP (FOIP)

    • @maiskorrel
      @maiskorrel ปีที่แล้ว

      I can guarantee you that people will just meme the shit out of each other

    • @sevens3383
      @sevens3383 ปีที่แล้ว

      Player customization will be more advanced soon hopefully

    • @knockdoun
      @knockdoun ปีที่แล้ว

      Star Citizen has this. Though its its a little outdated but they could totally update it with an improved version of something like this. current face tracking is goofy but does work

    • @Hundr_
      @Hundr_ ปีที่แล้ว +1

      Wow a new tool to commit fraud with!

  • @algorithminc.8850
    @algorithminc.8850 ปีที่แล้ว +35

    Love this channel. Wish more were as upbeat as yours ... Cheers.

    • @rick_vito
      @rick_vito ปีที่แล้ว +1

      make one!

    • @aronhighgrove4100
      @aronhighgrove4100 ปีที่แล้ว

      It sounds slightly hyperbolic and too whispery/aspirated at times though. It's a bit distracting when you constantly feel like it's a bit ad like.

  • @Skulll9000
    @Skulll9000 ปีที่แล้ว +3

    3:38 I like how confused the AI is when the lips are behind the tea cup.
    Input Video: "Siiip"
    AI: "schlop schlop schlop?"

  • @MarkArandjus
    @MarkArandjus ปีที่แล้ว +21

    If only this was on the market last year when I spent 6 intense months tracking dots on faces on mo-cap actors for a video game!
    There was so much manual cleanup and fixing because the captured facecam data wasn't always great, it was a lot like 2:19

    • @spork4966
      @spork4966 ปีที่แล้ว

      I want to do what you do when I grow up, but I don't know what you are doing and where I can start 😂

    • @janimelender2674
      @janimelender2674 ปีที่แล้ว +1

      Well here's your chance to do something with this paper. Create a company and offer your skills in facial tracking. If you're cheaper than you are right now, you'll make big bucks!
      In time you could become very proficient in this type of motion capture. Let's say it's weird with capturing blinking. Well "just" write an algo that detects when the top and bottom of the eye-lids are close to each other, in order to detect blinking. Then delete the blinking data from ~0.3 seconds before that detection until ~0.3 seconds after, and replace it with a standard blinking motion. Stuff like that will be worth gold, since you'll be able to do so much more work so much faster than industry standards.

  • @SymbolCymbals2356
    @SymbolCymbals2356 4 หลายเดือนก่อน +1

    “But can it track Jim Carry” feels like the “but can it run Crysis” of face tracking solutions lol

  • @TwoMinutePapers
    @TwoMinutePapers  ปีที่แล้ว +20

    Dear Fellow Scholars, we have a new sponsor in our ranks! Make sure to try their amazing AI-assisted editor here: runwayml.com/papers/

    • @bj124u14
      @bj124u14 ปีที่แล้ว +1

      Sponsorship WOOOOO!

  • @DataJuggler
    @DataJuggler ปีที่แล้ว +11

    This could take animation to another level. Nvidia's Audio2Face can already take text and generate emotions. Avatars that are indistinguishable from humans is probably not more than a year or two down the line.

    • @theamici
      @theamici ปีที่แล้ว +1

      indistinguishable through a casual look, sure, but algorithmically generated faces also generate data artifacts which can be traced by software

    • @MikkoRantalainen
      @MikkoRantalainen ปีที่แล้ว

      @@theamici Yes, totally perfect deep fakes are not yet close but the "video feed" you send to that Teams meeting can be easily virtual presentation of yourself in perfect lighting in a studio and nice clothing where ever you're actually attending from.

    • @Juanpopspacks
      @Juanpopspacks ปีที่แล้ว

      @@MikkoRantalainen or a president is seen on video doing something or saying something he didn’t.
      Scary Barry

    • @MikkoRantalainen
      @MikkoRantalainen ปีที่แล้ว

      @@Juanpopspacks Yes, the general public should be educated that a video can lie worth 10000 words.

  • @jdmeesey
    @jdmeesey ปีที่แล้ว +120

    This tech would be extremely useful for streaming: from Vtubers to facial tracking for green screens!
    I could also see this being used as a kind of encoder-decoder pair for telepresence applications, where one’s head and shoulders are sent as position data rather than a high resolution image, and then reconstructed on the receiver side for high speed, high fidelity, lighting insensitive video calls!

    • @livedandletdie
      @livedandletdie ปีที่แล้ว +7

      It would also be useful to be used in the reverse for more accurate facial expressions of characters in video games. Collecting enough data, and turning it all into vector space, and thus being able to perform more human expressions.
      There have been a lot of games throughout the year that have promised "realistic" facial expressions, but none of them have been up to par, with this they could quite easily use the data to improve that.
      Now the only thing that needs work is lighting of non-static objects.

    • @Cube_Box
      @Cube_Box ปีที่แล้ว +1

      nice idea

    • @dundermifflinity
      @dundermifflinity ปีที่แล้ว +4

      That teleconference idea exists already - I think, if I remember correctly, by Nvidia. This channel did a video on it a couple of years ago.

    • @dundermifflinity
      @dundermifflinity ปีที่แล้ว +1

      Here it is th-cam.com/video/dVa1xRaHTA0/w-d-xo.html

    • @etaashmathamsetty7399
      @etaashmathamsetty7399 ปีที่แล้ว

      @@livedandletdie oh ur 100% right about that
      But we probably won't see it any time soon

  • @bj124u14
    @bj124u14 ปีที่แล้ว +29

    The face tracking technology (without tracking points) is getting wild! The digital animation workflow is going to be sped up two fold in the next year! (Or at least, I think)

  • @cmilkau
    @cmilkau ปีที่แล้ว +2

    possible uses:
    motion capture
    facial animation
    facial sentiment detection (embedding, augmentation)
    facial recognition
    automated lip reading (embedding, augmentation)
    speech audio-to-animation (data augmentation)
    eye tracking

  • @Zebred2001
    @Zebred2001 ปีที่แล้ว +14

    I can't wait until you can walk around in an open-world game map in VR and engage any NPC in conversation and the AI will generate in it an actual personality with unique character traits - though the player will be able to set parameters barring types they don't want to run into!

    • @muhammedkoroglu6544
      @muhammedkoroglu6544 ปีที่แล้ว +1

      Definitely what I hope will happen too!

    • @proloycodes
      @proloycodes ปีที่แล้ว +2

      so basically an echo chamber?

    • @Zebred2001
      @Zebred2001 ปีที่แล้ว +2

      @@proloycodes If you want to set your parameters for provocative conversations or constant arguments go right ahead!

    • @joevaghn457
      @joevaghn457 ปีที่แล้ว

      No, if the AI is smart enough to do that, you’ll have absolutely zero say in what it’s personality is. Lol

    • @mhb11
      @mhb11 ปีที่แล้ว

      Ladies and gentlemen, an ERP junkie.

  • @yorzengaming
    @yorzengaming ปีที่แล้ว +13

    What a time to be *AI*

  • @luisca92
    @luisca92 ปีที่แล้ว +20

    This is literally another one of those papers that I've been waiting for I do work in animation i got my degree in 2d animation and this tool along with various other things that are coming out are going to revolutionize the animation more than it already is

    • @AhmedAli-kt1ez
      @AhmedAli-kt1ez ปีที่แล้ว

      How does a 3D tool help with 2D? On a side note I doubt anything made by A.I will ever look as good as Akira or Ghibli stuff.

    • @cjeff99
      @cjeff99 ปีที่แล้ว

      @@AhmedAli-kt1ez 3D models made to look 2D

    • @litjellyfish
      @litjellyfish ปีที่แล้ว +1

      @@AhmedAli-kt1ez the data can easily be flattened to 2D. Actually they already are in the end

    • @lovejsouvzacne
      @lovejsouvzacne ปีที่แล้ว

      is there a way to export blender files from it or do you know how to use it ? :)

    • @litjellyfish
      @litjellyfish ปีที่แล้ว

      @@lovejsouvzacne this is a real-time image to facial feature. I don’t think it handle the mesh generation. Also if it would it would be to industry standard format most likely and not Blender format

  • @bunkerputt
    @bunkerputt ปีที่แล้ว +1

    VR team meetings where you don't need a fake avatar.

  • @schrimpium9450
    @schrimpium9450 ปีที่แล้ว +6

    Mind blowing.

  • @juliandarley
    @juliandarley ปีที่แล้ว +85

    extremely impressive. is this facial performance capture now good enough for real movie-making? also it would be very useful to see the motion being targeted onto animations, especially animals.

    • @markusgp
      @markusgp ปีที่แล้ว +7

      Probably not if you can't tweak the results but it won't be long

    • @dan_loup
      @dan_loup ปีที่แล้ว +13

      It's actually even more impressive than what was stated in the video, because it's not only running on the CPU, it's running on one CPU core

    • @fabriperoconalgomasytodojunto
      @fabriperoconalgomasytodojunto ปีที่แล้ว +2

      If they gave it depth too then pretty sure it would

    • @jensenraylight8011
      @jensenraylight8011 ปีที่แล้ว +3

      the result from motion capture often are jagged, and the result will be baked on every frame,
      it's hard to clean it up , unless if they somehow invent the way to clean up the jagged in mocap.

    • @juliandarley
      @juliandarley ปีที่แล้ว +4

      @@jensenraylight8011 yes, cleaning up mocap has always been a total nightmare - very expensive and very tedious. if AI can clean up mocap and get rid of the unnatural jinks and jumps then that would be a huge leap forwards. recently, i have seen improvements, especially in fixing footsliding. maybe karoly find a paper that finally really fixes messy mocap.

  • @johnclark926
    @johnclark926 ปีที่แล้ว +3

    I wonder if expensive stuff you need for mocap will have their prices go down once these AIs reach a point where they’re on par and available to the general public

    • @danielmonge2318
      @danielmonge2318 ปีที่แล้ว +2

      It's going to happen the same thing that happened with the GPS market. It used to be very expensive to own one and there was a yearly fee for updated maps. Now we've got Google Maps for free on our phones!

  • @justanotherhotguy
    @justanotherhotguy ปีที่แล้ว +14

    This is a huge W for the gaming industry.

  • @timogul
    @timogul ปีที่แล้ว +4

    You know what would be really cool on a testing program like this? Use some glass materials to block people's faces in various ways, and film them in visible light and in IR, and then let the AI work with the IR footage to figure things out, meaning that the person's face is occluded, but then you would have visual footage from the same angle that you could overlay, so that you could see what it got right and got wrong perfectly.

    • @maiskorrel
      @maiskorrel ปีที่แล้ว

      I suppose that footage could also be used for training the AI to correct it's mistakes.

    • @mhb11
      @mhb11 ปีที่แล้ว

      And then measure the gaps for every data-point it got wrong, and then optimize those gaps via feeding them as training data.

  • @marshallross3373
    @marshallross3373 ปีที่แล้ว

    Really love your video summaries. Excellent work. Thanks for posting!

  • @c016smith52
    @c016smith52 ปีที่แล้ว +2

    Always love the reveal with "hold onto your papers!" or even the SqUeEzE your papers on this one. :) Great content as always, thank you!

  • @SirusStarTV
    @SirusStarTV ปีที่แล้ว +1

    Even slight misplacement of virtual face from real face is noticed by our brains

  • @gljames24
    @gljames24 ปีที่แล้ว +2

    This is going to be great for XR applications!

  • @Squidbush8563
    @Squidbush8563 ปีที่แล้ว +3

    Combine this tech with 3D animation, Voice synthesis, AI script writing, AI story creation, AI generated artwork, targeted advertising, cameras in theatres to gage response and interest.
    We can have "Hollywood in a box"
    and potentially add the tech of Brain Machine Interfacing,(the first of which has become commercially available very recently) we could even eliminate theatres and get better response from the audience.
    Of course, we all know what this will ACTUALLY be used for...

    • @snaphaan5049
      @snaphaan5049 ปีที่แล้ว

      Add to that CRISPR babies and the enormous benefits it has and in a couple of generations human beings as we know them will be a thing of the past. It's weird, almost every tech development , from self driving cars, boston dynamic's robots, midjourney AI, 3D printing etc is replacing physical labour. It's like man's answer to the Garden of Eden. We don't need God to go back there. We can counter "in the sweat of your brow" with and machine server power.
      The writing is on the wall. I mean, when you have silicon valley intrepreneurs waving normal human beings as a burden and glorifying a "new designed haminity" then we are pretty much damned.

  • @NeuralSensei
    @NeuralSensei ปีที่แล้ว +3

    This is great because image based tracking is always cheaper then any special sensors. Now nobody will need expensive headsets or iphone sensors for good face tracking.

  • @Mizrob10
    @Mizrob10 ปีที่แล้ว +1

    Your enthusiasm is infectious.

  • @kebakent
    @kebakent ปีที่แล้ว

    Does anyone know if this is available to the public? Like, can I download and run this model with the pretrained weights?

  • @MelissaAtwell
    @MelissaAtwell ปีที่แล้ว +3

    Wow, that’s amazing. ARKit still does some things better, because of the depth data. For example, in those side-by-side videos, you can see that ARKit better tracks the lips, around the eyes, and nose/nostrils. It’s the “nooks & crannies” of our faces that depth sensing can better interpret. The color algorithms are getting better & better though! They are still just a little more jiggly compared to color + depth. Thanks for sharing this cool video!

  • @altf4216
    @altf4216 ปีที่แล้ว +2

    because this works so well with obstructions, this might be useful for vrchat in the future...

  • @CoudyGeek25
    @CoudyGeek25 ปีที่แล้ว +1

    You could use this for real time augmented reality layovers of someone's virtual avatar over their face, that could be pretty cool.

  • @randomhuman1965
    @randomhuman1965 ปีที่แล้ว +1

    You make me squeeze the papers so hard they turned into Diamane

  • @redandpigradioshows
    @redandpigradioshows ปีที่แล้ว +2

    Started with the uncanny valley in the simulations and then somehow got every more unsettling with the wireframe robo-masks

  • @Cheesecannon25
    @Cheesecannon25 ปีที่แล้ว +2

    I'm excited to see this easier&higher quality capture in Vtubing

  • @Ninii0318
    @Ninii0318 ปีที่แล้ว +2

    This is outstanding, can't wait to try this new thing.

  • @JoaoPedro-ki7ct
    @JoaoPedro-ki7ct ปีที่แล้ว +1

    1:15 Baby wake up, a new NFT collection just dropped!

  • @openroomxyz
    @openroomxyz ปีที่แล้ว +20

    Nice Nice, seems like 2020-2030 will be a decade where AI will become common, in a big way, in software runing on desktop, laptops, smartphones.
    It's time for a lot of new cool software that will increase productivity and enable us to make new things that were previusly imposibile to make.

    • @rettenthetetlen8759
      @rettenthetetlen8759 ปีที่แล้ว +1

      Great recession with global unemployment.

    • @zdenekburian1366
      @zdenekburian1366 ปีที่แล้ว

      @@rettenthetetlen8759 exactly, and ultimately a big war, but there is a hope, a communist revolution making obsolete money, market, wage work, exploitment, capital

  • @robfriedrich2822
    @robfriedrich2822 ปีที่แล้ว

    In the 1970's German television had a movie "Welt am Draht" (World on a wire), where computer experts created a virtual world and were able to enter this. One of the virtual beings found a way in the real world and said, that this isn't the real world yet, finally it came, that the computer experts of the real world gave the technology to the virtual people.
    At least, the concept of virtual world is something, where we go to far.

  • @lexibyday9504
    @lexibyday9504 ปีที่แล้ว +7

    Whenever I watch one of these with virtual humans in it, I can't help feel disapointed that we'll still have to wait so much longer for the same to be possible with virtual not-humans. One day though we will be able to get an AI to generate animated or realistic characters of any kind from humans to fantasy creatures to aliens and beyond all with the same algorithm. And maybe ten papers down the line that same AI would be able to study a video of godzilla and create him. And one day all of this will lead to the destination I dream of.

    • @litjellyfish
      @litjellyfish ปีที่แล้ว +1

      This is already possible today. It’s just what mesh you want to use as output. Instagram etc snapshat filters do this.

    • @joevaghn457
      @joevaghn457 ปีที่แล้ว

      Yeah… Those with the craziest fetishes will definitely be satisfied.

  • @Vansafe0
    @Vansafe0 ปีที่แล้ว +1

    Wonderful as always

  • @ethzero
    @ethzero ปีที่แล้ว +3

    Said it before, we're right in the age that's making all the building blocks for what will one day be common a place Holodeck-like technology.

  • @andrelecozvideographer9030
    @andrelecozvideographer9030 ปีที่แล้ว

    any info on when this will become a product i can purchase? any clear date on the horizon?

  • @dh8956
    @dh8956 ปีที่แล้ว +2

    The fact, that news outlets are using this technology, is troubling.

  • @ZackThoreson
    @ZackThoreson ปีที่แล้ว +3

    We need good vr hand tracking!!

  • @alfredocalvimontes2488
    @alfredocalvimontes2488 ปีที่แล้ว

    There is a sample for testing ir the source code si available?

  • @allanh6076
    @allanh6076 ปีที่แล้ว +2

    I see what pushes you forward, the rewrite of the Game of Thrones final series! I cannot wait

  • @tekoneiric
    @tekoneiric ปีที่แล้ว +2

    I've thought for years that AI will eventually be able to break down movies and TV shows into it's elements, including actor appearances, voice and acting styles. This could be used as part of that by capturing the facial expressions of actors. One thing I see that could be improved is by identifying expression anchor points of different faces and people. Topographical maps isn't enough when it comes to faces. People typically have different points on their faces where expressions seem to radiate from. It's probably tied to either muscle anchor points or just how different people learn to control their facial muscles as they grow up.

  • @ryanloughlin3485
    @ryanloughlin3485 ปีที่แล้ว +1

    Omg seeing Mona Lisa move like that was awesome! I love technology ❤️

  • @geordonworley5618
    @geordonworley5618 ปีที่แล้ว +3

    Let's use it for VR remote conferences.

  • @ninhil2
    @ninhil2 ปีที่แล้ว +3

    If it can track Jim Carrey, it can track everyone

  • @kirbmeister_
    @kirbmeister_ ปีที่แล้ว

    "what a time to be alive!" is my favorite catchphrase

  • @TheWolfgangplayer
    @TheWolfgangplayer ปีที่แล้ว

    love your videos! amazing tech!

  • @charlesblithfield6182
    @charlesblithfield6182 ปีที่แล้ว

    I love your delivery!

  • @SMASH_REVIEWS
    @SMASH_REVIEWS ปีที่แล้ว +1

    Signed up on Runway, excited to see what it can do.

  • @yorgle
    @yorgle ปีที่แล้ว +12

    It's still super weird to me that you can train and AI on computer-generated data... it's like the computers can teach themselves based off of their own imaginations... Love it! and thank you!

    • @spideyninja
      @spideyninja ปีที่แล้ว +5

      It's important to mention that the entire dataset is procedurally generated, but not via an AI. Thus it's 100% precise.

    • @yorgle
      @yorgle ปีที่แล้ว +1

      @@spideyninja Fair enough. :)

    • @cube2fox
      @cube2fox ปีที่แล้ว +1

      AlphaZero (Go software with superhuman performance) was also completely trained on synthetic data.

  • @SkyShazad
    @SkyShazad ปีที่แล้ว

    so is this available to download yet to test?

  • @BlackShortCurley
    @BlackShortCurley ปีที่แล้ว +1

    Look forward to converting scanned faces to a miniature model, so that players can literally play as themselves in DnD

  • @hr3nk
    @hr3nk ปีที่แล้ว +1

    I must say that ideas like using data generation from virtual environment like using footage from virtual environment for training driving AI is just something that I haven't heard of but it so much on the surface, considering that you get both structured and unstructured data distributions at the same time, and very little work is needed by human. If someone knows papers on successful application of virtual environment used for training (outside of RL of course, I am talking more CV), please link it here, would really appreciate!

  • @karolkornik
    @karolkornik ปีที่แล้ว

    Amazing work. I love it :*

  • @freedom_aint_free
    @freedom_aint_free ปีที่แล้ว +12

    Let be honest folks: I know that y'all wanna a massive open world RPG like Skyrim where every single NPC is lifelike and totally unique and will pass the Turing test blindfolded.

    • @elichapin3366
      @elichapin3366 ปีที่แล้ว +1

      your completely right

    • @michaelleue7594
      @michaelleue7594 ปีที่แล้ว +1

      I mean, yes, I do, but we are so far from that. Look at Evermore: it's basically a real life version of this, with real actors playing the parts of NPCs, and you *still* can't get anything approaching an authentic experience. Even *actual people* are no good at passing the Turing test in a fictional setting.

  • @jeremyccc
    @jeremyccc ปีที่แล้ว

    Dr! I just saw you are presenting at GTC this year! Signed up immediately for your presentation when I saw your name

  • @JoshLathamTutorials
    @JoshLathamTutorials ปีที่แล้ว

    I wasn't ready for the Two Minute Papers face reveal, but I'm glad i now have a face to the soothing voice!

  • @xRays6
    @xRays6 ปีที่แล้ว +2

    Get these virtual fellers hooked up with a gpt3 brain, stick em in gta v, and boom simulation confirmed

  • @joanz4811
    @joanz4811 ปีที่แล้ว +1

    Regarding deep-fakes-.. Imagine this, having a photo of a relative or someone who's passed, or a dog, and preset movements- or ai to make them more life-like. Sortof like the pictures and paintings in Harry Potter. Digital frames already exist- but "magical" moving people frames don't. Fun idea for anyone savvy out there!

  • @Game_with_me-r6j
    @Game_with_me-r6j ปีที่แล้ว +2

    I'm gonna use this in the Metaverse.

  • @huyked
    @huyked ปีที่แล้ว +1

    0:33 This makes me think that we are living in a simulation. Dang.

  • @Uhfgood
    @Uhfgood ปีที่แล้ว +1

    almost makes me wonder if they couldn't just use existing data to determine what's behind the obstructions. If it can match a face to an existing model, and can position that over the tracking data, then use points on the model to realign those it can't see. So then it should match pretty well. Obviously it's less than trivial to program, but I think, the answer to tracking things that are obstructed is simply to use similar models and track them in the parts they can't see.

  • @Pauluz_The_Web_Gnome
    @Pauluz_The_Web_Gnome ปีที่แล้ว

    6:23 in sync with the narrator! :D Amaaaazing! lol

  • @MilMike
    @MilMike ปีที่แล้ว

    6:40 and 7:00 - can we use this nowadays? or is this some concept? care to share a link or name of the git repository / paper? I love the possibility to move some face inside a photo using my own face

  • @pabloroldandirector
    @pabloroldandirector ปีที่แล้ว

    Amazing! but i didn't get where I can find this tech to try it???

  • @smcclure3545
    @smcclure3545 ปีที่แล้ว +2

    So will we be seeing cameras auto-orienting a person's avatar so that they look like their staring into a camera when it's actually off-center? I could see the system calculating the center of a screen-partner's face as the center of where a camera should be, and adjusting the avatar of the caller (so it seems to be oriented as though the camera were placed there). It could take the communication awkwardness out of video calls and teleconferences. Moreover, I think these intereractions are like talking to a person who avoids eye contact, essentially undermining confidence and hindering the adoption of video conferencing at a psychological level.

    • @MikkoRantalainen
      @MikkoRantalainen ปีที่แล้ว

      If that's the only think you want to fix, half mirror in front of your camera and teleprompter-like screen setup is all you need.
      However, if you want to show yourself in perfect lighting and some specific clothing and background, then you need the tech from this video.

  • @TundeEszlari
    @TundeEszlari ปีที่แล้ว +2

    Elképesztő lett a kontent, csak így tovább!❤

    • @Sekir80
      @Sekir80 ปีที่แล้ว

      Nem simán kontent! Valódi mondanivaló. ;)

  • @NanoNutrino
    @NanoNutrino ปีที่แล้ว

    when is this available? Will it just be an app on a phone?

  • @roibuda9448
    @roibuda9448 ปีที่แล้ว

    Great content, thank you

  • @themightyflog
    @themightyflog ปีที่แล้ว

    How can we use this stuff?

  • @NontoxicRadiation
    @NontoxicRadiation ปีที่แล้ว +6

    I think we all know what people are going to use this for

  • @Plafintarr
    @Plafintarr ปีที่แล้ว +1

    My goodness! Love it!

  • @META_mahn
    @META_mahn ปีที่แล้ว +1

    I can't hold onto my papers! My papers flew out of my hands before the video even started!

  • @qAngel
    @qAngel ปีที่แล้ว +1

    can't believe humans are real now. terrifying.

  • @ricardoabh3242
    @ricardoabh3242 ปีที่แล้ว

    very interesting concept!

  • @JustWasted3HoursHere
    @JustWasted3HoursHere ปีที่แล้ว

    When I see this sort of thing I immediately think of the benefits it would have for visual effects in movies, specifically in regards to augmenting an actor's face with some CG _without even needing to place dots on their faces anymore!_ Think of the old TV shows like Star Trek The Next Generation where characters like Worf, the Klingon, had to endure 3 or more hours of makeup every single day. Using the above technology, Michael Dorn could just come into work wearing his Starfleet costume and the rest would be done in post! And the best part is that it would actually look BETTER. Even up close shots would likely be perfect, especially once this technology is further developed.
    What a time to be alive!

  • @Exilum
    @Exilum ปีที่แล้ว +6

    AI research always impresses me. As a dreamer myself, I see every new paper as a new tool that could be used for virtual worlds. Now I would love to see what AI research could do with discrete BCIs like Neuralink or Synchron. I feel like BCI research has been quite isolated from other fields outside of the startup world, and it could gain from exploring neural nets a bit more (especially with the results we saw from Neuralink).

  • @Perplexer1
    @Perplexer1 ปีที่แล้ว

    5:24 "Squeeze that paper" ...... When holding on to your papers just doesn't cut it anymore.

  • @stevesloan6775
    @stevesloan6775 ปีที่แล้ว

    Id love to use it for facial tracking simulations, where the face is a real persons, but Ai virtual.
    Im sure that could be a method that would create better facial capture tracks.

  • @curb_shifter
    @curb_shifter ปีที่แล้ว

    any extended research done on lip reading ?

  • @IsHekiel
    @IsHekiel ปีที่แล้ว

    the question is , where can i get it and how do i use it on blender and so on,

  • @envokazbendi
    @envokazbendi ปีที่แล้ว

    Naon jó lett.Csak így tovább. Imadom a videoidat.

  • @seraimet
    @seraimet ปีที่แล้ว +2

    wow, i was looking in all internet some appropiate topic regarding virtual costumes for filmmakers... wow, it's insane... this will be a serios plus for filmmakers's buget... Waiting for unreal engine to implement all this stuff soon

  • @NafanyaZX
    @NafanyaZX ปีที่แล้ว

    08:09 "instantly remove unwanted objects" removes humans 😂

  • @Ben-rz9cf
    @Ben-rz9cf ปีที่แล้ว +1

    Lets hope this is open source or it somehow finds its way over to the boys at Epic. This algorithm needs to be implemented in live link face like, yesterday. Imagine metahumans with this level of mocap fidelity.

    • @joevaghn457
      @joevaghn457 ปีที่แล้ว

      It’s Microsoft, so who knows. They probably patented it, and likely won’t open source the most critical parts

  • @DRealHammer
    @DRealHammer ปีที่แล้ว

    But the technique seems to have some temporal inconsistency when occlusion happens. Would be awesome to fix that too.

  • @knucklessg1
    @knucklessg1 ปีที่แล้ว +1

    This is really a behind the scenes for Grand Theft Auto 6

  • @Voudoo1
    @Voudoo1 ปีที่แล้ว +3

    Facial recognition
    The dream tool of all governments....

  • @radioactiveag2531
    @radioactiveag2531 ปีที่แล้ว +1

    As I make characters in Character Creator.. lol this is amazing.

  • @SaintMatthieuSimard
    @SaintMatthieuSimard ปีที่แล้ว

    Awesome where do I download it?

  • @bencrossley647
    @bencrossley647 ปีที่แล้ว +4

    Hi Karoly,
    I have a degree in maths and went into teaching. What would you recommend to someone looking to get into AI? I'm hoping to transition but have little coding experience.
    I find every video you produce fascinating and want to get on the other side of the curtain.
    Thank you for your efforts. Ben

    • @slangster42
      @slangster42 ปีที่แล้ว +4

      Unfortunately, I think that coding is required to work in AI related fields. Personally, I'd recommend you to read a few technical books about AI, just to make sure that you really want to switch from teaching to AI, and to get some basics in programming. I think that "Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow" by Aurelien Geron is a great start. Just be sure to have the second edition and not the first, because the first edition uses an outdated python library. In any case, you'll also have to learn to code in python, and there are probably a lot of books or tutorials online for that as well.

    • @bencrossley647
      @bencrossley647 ปีที่แล้ว +1

      @@slangster42 Lovely reply. Thank you.
      I've started teaching myself python by doing the first 20 ish Euler problems.
      Thank you for the recommendation. I'll check it out :)

  • @Pandarah
    @Pandarah ปีที่แล้ว

    Imagine having to undergo surgery. Your body get scanned, as well as your brain under different circumstances like joy, rest, pain etc. Your full body scan including all the interior tissue gets fed into an AI operated robot surgeon. It analyses all the data, figures out the best procedure for this surgery to be a succes, with minimal incisions, movements and risks. The AI then explains the whole procedure to the doctors involved, so they understand what it is going to do. It monitors your brain patterns to be sure you're actually under anesthesia during the procedure. Difficult procedures that once took many hours and involved huge risks, could now be done in 45 minutes, with next to no risk involved. Doctors would still be in the room, to intervein when necessary, as one of many failsafe systems of course.