Unveiling Meta's Impressive CV Model: Sam 2

แชร์
ฝัง
  • เผยแพร่เมื่อ 21 ต.ค. 2024

ความคิดเห็น • 77

  • @randotkatsenko5157
    @randotkatsenko5157 2 หลายเดือนก่อน +34

    Ive developed a fully automatic video editor fort short form content. It would read the frames of the video, images, transcript and apply various effects based on simple text instructions.
    THIS Meta's new model is THE missing piece to enable editing more complex scenarios in full TH-cam-length videos. Impressive indeed and thank you for sharing this prompty after the release, Sam!

    • @samwitteveenai
      @samwitteveenai  2 หลายเดือนก่อน +3

      Very cool use and makes total sense. These models are getting so good we are bound mostly by our imagination of how to use them.

    • @picklenickil
      @picklenickil 2 หลายเดือนก่อน

      @@randotkatsenko5157 you have peaked my intrigue.. fellow nerd..

    • @navroopsingh8902
      @navroopsingh8902 2 หลายเดือนก่อน

      I had the same idea few months back. But had mercy on my laptop.

    • @navroopsingh8902
      @navroopsingh8902 2 หลายเดือนก่อน

      Can you share the repo link if its an open source project?

    • @amandamate9117
      @amandamate9117 2 หลายเดือนก่อน

      yeah but can you use this SAM2 locally(what hardware you need) and how would you implement in your workflow?

  • @nemonomen3340
    @nemonomen3340 2 หลายเดือนก่อน +6

    Cool to see Meta is releasing a new and improved version of Sam Altman.

  • @thenoblerot
    @thenoblerot 2 หลายเดือนก่อน +2

    Love the channel. I appreciate content that's not just all LLMs, all the time

    • @samwitteveenai
      @samwitteveenai  2 หลายเดือนก่อน +2

      Thanks for the feedback, I will probably broaden a the coverage going forward.

  • @TailorJohnson-l5y
    @TailorJohnson-l5y 2 หลายเดือนก่อน +1

    Never been a fan of Zuck. But that's all changing now. What hes doing by *truly* open sourcing everything is game changing for humanity. Thanks Sam! Keep it going Zuckerburg!!!

  • @tecnom7133
    @tecnom7133 2 หลายเดือนก่อน +1

    Thanks for sharing, waiting for a Project using this Model !

  • @picklenickil
    @picklenickil 2 หลายเดือนก่อน +24

    Sam talking about Sam!
    Multiverse of Sam😂

    • @samwitteveenai
      @samwitteveenai  2 หลายเดือนก่อน +4

      lol yes a well named model! 😂

    • @starblaiz1986
      @starblaiz1986 2 หลายเดือนก่อน +1

      I heard you like Sam, so we got Sam to tell you about Sam so that you can put a Sam in your Sam 😂

    • @saipraneethdevunuri3156
      @saipraneethdevunuri3156 2 หลายเดือนก่อน

      You've beat me to it

    • @flyingapple7119
      @flyingapple7119 2 หลายเดือนก่อน

      It's very Meta

  • @MrDanINSANE
    @MrDanINSANE 2 หลายเดือนก่อน +1

    Is there a friendly user interface project such as Gradio interface to test SAM2 with video locally? (demo is there of course) but Locally will be nice to test with different hardware since SAM2 supposed to be faster.

  • @toadlguy
    @toadlguy 2 หลายเดือนก่อน

    This model, released in this way, will lead to so many interesting new applications. It would seem that it's use in sports and fitness analysis could be impressive. And even something like traffic analysis, which is currently being done by expensive systems can be done with a consumer camera and open source software. Kudo's to Meta, Mark, and, of course, the OG Sam for letting us all know about it 😁.

  • @paulmiller591
    @paulmiller591 2 หลายเดือนก่อน

    Oh no I will certainly need to add this to my ever-expanding list of AI investigations. Yes this is undoubtedly worthy cheers Sam.

  • @DanieleCorradetti-hn9nm
    @DanieleCorradetti-hn9nm 2 หลายเดือนก่อน +17

    You should start covering vision models as you do LLMs🎉❤

    • @samwitteveenai
      @samwitteveenai  2 หลายเดือนก่อน +9

      Thanks for the feedback, was wondering if there was interest.

    • @mshonle
      @mshonle 2 หลายเดือนก่อน

      I think the concept of transfer learning is really successful for vision models and hence vision models are a great way to explain this larger concept.

    • @dheerajvarma9527
      @dheerajvarma9527 2 หลายเดือนก่อน +1

      Please cover industrial usecases using CV models as well

    • @samwitteveenai
      @samwitteveenai  2 หลายเดือนก่อน

      Curious are there any techniques you want in particular?

    • @DanieleCorradetti-hn9nm
      @DanieleCorradetti-hn9nm 2 หลายเดือนก่อน

      ​@@samwitteveenaicrack detections e segmentation? Is a kind of standard

  • @unclecode
    @unclecode 2 หลายเดือนก่อน +24

    Wow! What’s left for OpenAI? How can they still be valued at $70 billion? Again Meta released another large model that allows people to generate synthetic data. I think the moat wasn't the LLM was becoming the one democratise it for everyone and Meta did that!

  • @FredPauling
    @FredPauling 2 หลายเดือนก่อน +9

    More exciting than Llama 405b

  • @ahmadzaimhilmi
    @ahmadzaimhilmi 2 หลายเดือนก่อน +2

    Those who aren't following AI news are missing out on all the advancements in this field.

  • @henkhbit5748
    @henkhbit5748 2 หลายเดือนก่อน

    Impressive indeed. 👍 for Meta for open sourcing the model. I suppose Marc like your name and did a rocket launch😄

  • @NicolasEmbleton
    @NicolasEmbleton 2 หลายเดือนก่อน

    This is sick 😮. Thanks for sharing. Missed that.

  • @ricosrealm
    @ricosrealm 2 หลายเดือนก่อน

    Meta is killing it right now with fully open models.

  • @NakedSageAstrology
    @NakedSageAstrology 2 หลายเดือนก่อน +1

    Good video. Thank you for sharing.

  • @v1nigra3
    @v1nigra3 27 วันที่ผ่านมา

    This is so next level

  • @nullsmack
    @nullsmack 2 หลายเดือนก่อน

    is this something that can be run locally on custom video?

  • @sambarjunk
    @sambarjunk 2 หลายเดือนก่อน

    Great walkthrough

  • @SonGoku-pc7jl
    @SonGoku-pc7jl 2 หลายเดือนก่อน

    10gb vram? is posible divide 16 ram with 4 vram nvidia 1070gtx? or cuantatizate uint8 or int4 i don't remember exactly word for presicion :P and you can make example of data from sam2 and make something with florence :)

  • @nufh
    @nufh 2 หลายเดือนก่อน

    Can we run this in normal computer or need a high end one?

  • @albertsitoe7340
    @albertsitoe7340 2 หลายเดือนก่อน

    Should run on the Apple Vision Pro with its M2?

    • @samwitteveenai
      @samwitteveenai  2 หลายเดือนก่อน

      probably but will need to be converted

  • @florentromanet5439
    @florentromanet5439 2 หลายเดือนก่อน

    Great analysis thanks 👍🙏

  • @IdPreferNot1
    @IdPreferNot1 2 หลายเดือนก่อน

    Access to demo denied? Can you please provide uopdated link to code, thx

  • @NakedSageAstrology
    @NakedSageAstrology 2 หลายเดือนก่อน +1

    Her name is Samantha. Tomorrow is July 31st.

  • @WillJohnston-wg9ew
    @WillJohnston-wg9ew 2 หลายเดือนก่อน

    how do you think this would do for real-time sentiment analysis on a face?

    • @pladselsker8340
      @pladselsker8340 2 หลายเดือนก่อน

      There are many really good face tracking and preprocessing algorithms out there. SAM2 would only be able to do the "tracking" part. You would still need to do further processing to infer emotions.
      You could maybe replace the first and last few layers of the architecture with a custom one, freeze the middle parameters, and train on custom data. This last approach would probably give better results.

  • @sondoan3070
    @sondoan3070 2 หลายเดือนก่อน

    🎯 Key points for quick navigation:
    🆕 Meta released the SAM 2 model, enhancing computer vision capabilities with real-time video processing.
    🎥 SAM 2 supports video analysis, capable of processing up to 44 frames per second for real-time segmentation and tracking.
    🛠️ The model allows segmentation with prompts and has been simplified for easier use, now integrating temporal memory.
    🚀 SAM 2 is six times faster than its predecessor and offers improved accuracy and efficiency for data annotation.
    📈 Meta has released SAM 2 with open-source code and weights under the Apache 2 license, promoting broader accessibility.
    📊 A dataset of 51,000 videos and over 600,000 masklets accompanies SAM 2, aiding in the development of custom models.
    🎨 The model can be used for various effects and applications, including real-time video effects and creative annotations.
    💻 Example notebooks provided demonstrate how to use SAM 2 for accurate segmentation and tracking in both images and videos.
    Made with HARPA AI

  • @marioignacio3440
    @marioignacio3440 2 หลายเดือนก่อน

    Is it ready to use now? Im in Asia right now..

    • @samwitteveenai
      @samwitteveenai  2 หลายเดือนก่อน

      yes you should be able to use it anywhere, it is an open weights model

  • @rohaidyrodriguez7179
    @rohaidyrodriguez7179 2 หลายเดือนก่อน

    Great for AR

  • @minhazrahman7085
    @minhazrahman7085 2 หลายเดือนก่อน

    Sam, help me on an approach. I want to use the RAG for different type of task. Rather than making knowledgebase, I want it to able to differentiate documents as different reports, and compare-contrsat, search across docs type of stuff. What type of pipeline should I follow? All the Rag models are mostly build to make knowledgebase, not compare, a lot of documents with precise output.

    • @samwitteveenai
      @samwitteveenai  2 หลายเดือนก่อน +2

      you can use meta data to keep their identities separate etc as reports. What else do you want to do with them? You can use query rewriting to get and compare info between the reports etc.

    • @KevinKreger
      @KevinKreger 2 หลายเดือนก่อน +1

      Ask it to rank or rate instead of compare, if possible. You'll get better results.

  • @imevano21
    @imevano21 2 หลายเดือนก่อน

    Does anyone else have trouble installing sam2. I have cuda 12.4 and set my env variable but I'm getting:
    raise OSError('CUDA_HOME environment variable is not set. '
    OSError: CUDA_HOME environment variable is not set. Please set it to your CUDA install root.
    Would really appreciate any help

    • @rahulprajapati941
      @rahulprajapati941 2 หลายเดือนก่อน

      You'd need to add CUDA and cuDNN paths to the environment variables on your Windows

  • @starblaiz1986
    @starblaiz1986 2 หลายเดือนก่อน

    Meta beating up OpenAI:
    Everyone: "Pleaaaaaase stoooop! He's already unalive!" 😂

  • @ZIYUNI-h5k
    @ZIYUNI-h5k 2 หลายเดือนก่อน

    thx for sharing🎉

  • @MultiverseMayhemtoyou
    @MultiverseMayhemtoyou 2 หลายเดือนก่อน

    wow AI Apache helicopter ! ?

  • @frazuppi4897
    @frazuppi4897 2 หลายเดือนก่อน

    I mean the model is generating segmentation mask so it is generative ai

  • @Psychopatz
    @Psychopatz 2 หลายเดือนก่อน

    I hope the gimp team can utilize this

    • @samwitteveenai
      @samwitteveenai  2 หลายเดือนก่อน

      yeah you could imagine it could do some nice things in that app

  • @florentromanet5439
    @florentromanet5439 2 หลายเดือนก่อน +1

    Meta releases it's 2024 second quarter results tomorrow. Sounds like they spent a "lot" of money 💰 into 405b and SAM2 training... Open-source licensing all this is maybe their way to balance this 😅

    • @samwitteveenai
      @samwitteveenai  2 หลายเดือนก่อน +1

      Yeah it would be interesting if the broke out the costs of these models.

  • @fontenbleau
    @fontenbleau 2 หลายเดือนก่อน +2

    Police are very interested in this, considering masses of video material they have from public places, laws are very bad about this esp in USA with cellphones wiretapping by Stinger & other systems.

    • @samwitteveenai
      @samwitteveenai  2 หลายเดือนก่อน

      This is certainly true!

    • @toadlguy
      @toadlguy 2 หลายเดือนก่อน

      The police already have technology similar to SAM 2, now this will be available to all of us. 😉

  • @erobusblack4856
    @erobusblack4856 2 หลายเดือนก่อน +1

    i tried but it wasn't compatible with my beastly phone 😂

    • @erobusblack4856
      @erobusblack4856 2 หลายเดือนก่อน +1

      my android software experience is perfectly customized

  • @gokulakrishnanm
    @gokulakrishnanm 2 หลายเดือนก่อน

    Meta slaughtering OpenAI😂

  • @tonyppe
    @tonyppe 2 หลายเดือนก่อน

    Sam a.i. 😁

  • @MrNobodyX3
    @MrNobodyX3 2 หลายเดือนก่อน

    I don't like the fact that you're basically marketing this and not reviewing it you're not pointing out the glaring issues that you see in the video like the ball not being tracked or the eyeball at the Byrd flickering in and out

    • @v1nigra3
      @v1nigra3 27 วันที่ผ่านมา

      Because that’s easy to fix silly

  • @Csepowertrip123
    @Csepowertrip123 2 หลายเดือนก่อน

    Jab at open AI look what they named it lol