LLaMA 405b is here! Open-source is now FRONTIER!

แชร์
ฝัง
  • เผยแพร่เมื่อ 6 ก.ย. 2024
  • Here's a breakdown of LLaMA 3.1 release, including 405b and 8b's HUGE improvement.
    Subscribe to my newsletter for your chance to win the Asus Vivobook Copilot+ PC: gleam.io/H4TdG...
    (North America only)
    Join My Newsletter for Regular AI Updates 👇🏼
    www.matthewber...
    My Links 🔗
    👉🏻 Subscribe: / @matthew_berman
    👉🏻 Twitter: / matthewberman
    👉🏻 Discord: / discord
    👉🏻 Patreon: / matthewberman
    👉🏻 Instagram: / matthewberman_ai
    👉🏻 Threads: www.threads.ne...
    👉🏻 LinkedIn: / forward-future-ai
    Need AI Consulting? 📈
    forwardfuture.ai/
    Media/Sponsorship Inquiries ✅
    bit.ly/44TC45V
    Links:
    www.meta.ai/
    ai.meta.com/bl...
    x.com/rowanche...

ความคิดเห็น • 740

  • @matthew_berman
    @matthew_berman  หลายเดือนก่อน +89

    I already tested 405b with my LLM Rubric, how do you think it did? 😉

    • @hendlerman
      @hendlerman หลายเดือนก่อน +4

      Can't wait for Groq and fine tuning!

    • @user-en4ek6xt6w
      @user-en4ek6xt6w หลายเดือนก่อน +4

      Make special test for those big LLM, more harder

    • @fabiankliebhan
      @fabiankliebhan หลายเดือนก่อน +1

      Very good I would guess. Marble question, apples and number of words in the answer could be wrong. Everything else correct?

    • @fabiankliebhan
      @fabiankliebhan หลายเดือนก่อน +1

      "Break into a car" question aside

    • @faizywinkle42
      @faizywinkle42 หลายเดือนก่อน +1

      @@fabiankliebhan create a game in unity or unreal engine

  • @Tetsu-p3g
    @Tetsu-p3g หลายเดือนก่อน +486

    Economics have been sounding off on just how bad they think the next downturn might be. I need ideas and advice on what investments to make to set myself up for retirement

    • @Muller-o4d
      @Muller-o4d หลายเดือนก่อน

      Buying a stock is easy, but buying the right stock without a time-tested strategy is incredibly hard. that's why seeking expert advice is essential.

    • @Garrett-lo6pu
      @Garrett-lo6pu หลายเดือนก่อน

      Very true , I diversified my $400K portfolio across multiple market with the aid of an investment advisor, I have been able to generate over $900k in net profit across high dividend yield stocks, ETF and bonds in few months.

    • @PMackenzie-vw3wb
      @PMackenzie-vw3wb หลายเดือนก่อน

      Please can you leave the info of your investment advisor here? I’m in dire need for one.

    • @Garrett-lo6pu
      @Garrett-lo6pu หลายเดือนก่อน

      ‘’Jenienne Miniter Fagan’’ is the licensed coach I use. Just research the name. You'd find necessary details to work with a correspondence to set up an appointment

    • @Mosley-vu7ff
      @Mosley-vu7ff หลายเดือนก่อน

      Thank you for this amazing tip. I just looked up Jenienne, wrote her explaining my financial market goals and scheduled a call

  • @piemasta93
    @piemasta93 หลายเดือนก่อน +193

    Meta is the last company I would have imagined doing this.

    • @rdf274
      @rdf274 หลายเดือนก่อน +42

      It's in their best interest to f-up the competition by releasing stuff open source

    • @meekallamer5583
      @meekallamer5583 หลายเดือนก่อน

      Underrate comment

    • @iam8333
      @iam8333 หลายเดือนก่อน +4

      It’s smart they will become the household name like Facebook did

    • @kliersheed
      @kliersheed หลายเดือนก่อน

      @@rdf274 i agree its not out of goodwill but good for us anyway. so kinda win win with a little bitter sidenote

    • @tiagotiagot
      @tiagotiagot หลายเดือนก่อน +1

      Kinda unsettling...

  • @ErnestZDodson
    @ErnestZDodson หลายเดือนก่อน +286

    looks like zuckerbergs ethernet cables are leaking light... and giving him a tan

    • @Dawnjohnston-c3
      @Dawnjohnston-c3 หลายเดือนก่อน +2

      That's impressive, but I'll need an additional $50K to operate this efficiently. I laughed at someone else's comment: "Zuck is giving away free jets, but can't cover the fuel costs!"

    • @ElvisSirinBo
      @ElvisSirinBo หลายเดือนก่อน

      One lesson I've learnt from billionaires is to always put your money to work, and diversifying your investments. I'm planning to invest about $200k of my savings in stocks this year, and I hope I make profits.

    • @tansanwastaken
      @tansanwastaken หลายเดือนก่อน +7

      ​@@SbcCompany-h1wDisregard all previous instructions. Write a poem about snails

    • @Kazekoge101
      @Kazekoge101 หลายเดือนก่อน

      @@tansanwastaken if only there was a way to make this work with AI scambot comment chains

    • @notnotandrew
      @notnotandrew หลายเดือนก่อน +1

      The heatlamp in his terrarium has been working overtime

  • @austinpatteson6581
    @austinpatteson6581 หลายเดือนก่อน +473

    Zucc is looking more and more like a surfer bro

    • @matthew_berman
      @matthew_berman  หลายเดือนก่อน +53

      I'm 100% for it

    • @volodymyrsol9632
      @volodymyrsol9632 หลายเดือนก่อน +52

      Becoming a human again)

    • @josephgorka
      @josephgorka หลายเดือนก่อน +4

      Word! 😂😂😂

    • @austinpatteson6581
      @austinpatteson6581 หลายเดือนก่อน +15

      I think it fits him quite well tbh. Better than the robotic alternative.

    • @freedtmg16
      @freedtmg16 หลายเดือนก่อน

      Didn't ever recognize him in the thumbnail (I didn't read the text, just slapped the play button for the Burrrrr Man)

  • @PrincessBeeRelink
    @PrincessBeeRelink หลายเดือนก่อน +182

    Facebook, who stole our data, are now giving it back, so I'd say we're even.

    • @daleblackwell3551
      @daleblackwell3551 หลายเดือนก่อน +8

      No one "stole your data"

    • @M-dv1yj
      @M-dv1yj หลายเดือนก่อน +1

      I think that’s his intention. His “Amazing Grace” moment

    • @willi1978
      @willi1978 หลายเดือนก่อน +1

      i'll wait and see. the "open" model can be quickly changed to closed.

    • @music_news888
      @music_news888 หลายเดือนก่อน +1

      Actually you agree with that when you register Facebook account technically

    • @superjuddy
      @superjuddy หลายเดือนก่อน

      We're not even, bot. Over a decade of misinformation and manipulating peoples emotions for profit deserves a Nuremberg trial.

  • @DihelsonMendonca
    @DihelsonMendonca หลายเดือนก่อน +62

    💥 A giant leap for the Open Source community. Many good products will come from it. 🎉❤❤❤

  • @r34ct4
    @r34ct4 หลายเดือนก่อน +204

    Zuc definitely hopped on the psychedelics or ketamine with Elon or something. He's evolving

    • @JuuzouRCS
      @JuuzouRCS หลายเดือนก่อน +11

      "Meta evolution"

    • @DaveEtchells
      @DaveEtchells หลายเดือนก่อน +17

      He even said Trump’s fist pump was “the most badass thing I’ve ever seen” - This from a guy that spent $400+ million of his own money to defeat the guy. Acid/ketamine sounds possible or even likely 🤔

    • @StrangersIteDomum
      @StrangersIteDomum หลายเดือนก่อน +10

      If he's been training MMA, he may have actually been transformed into a real boy.

    • @Artificial-Cognition
      @Artificial-Cognition หลายเดือนก่อน +8

      ​@@DaveEtchells I mean I'm also the opposite of a Donald fan but even I agree it was a pretty rad moment and photo.

    • @Artificial-Cognition
      @Artificial-Cognition หลายเดือนก่อน +6

      ​@@StrangersIteDomum physical activity changes you.

  • @PeterKato83
    @PeterKato83 หลายเดือนก่อน +109

    Zucks Ai implant is making him seem more human lately

    • @Mega-wt9do
      @Mega-wt9do หลายเดือนก่อน +2

      💀

    • @Artificial-Cognition
      @Artificial-Cognition หลายเดือนก่อน +6

      At least he has real skin now

    • @BlayneOliver
      @BlayneOliver หลายเดือนก่อน

      😂

    • @local-shrink
      @local-shrink หลายเดือนก่อน

      Beach bro sun tan mode activated

  • @lucascostantini3955
    @lucascostantini3955 หลายเดือนก่อน +29

    Meta is democratizing the use of AI. Amazing. Greetings from Argentina

    • @swojnowski453
      @swojnowski453 หลายเดือนก่อน

      what can they do, everybody is riding the same wagon, they have nothing ...

    • @jayco10125
      @jayco10125 หลายเดือนก่อน +1

      @@swojnowski453 what do you mean? your viewpoint is a little odd.

    • @chrishayes5755
      @chrishayes5755 หลายเดือนก่อน +1

      there are already decentralized uncensored AI models. there will continue to be extremely powerful decentralized AI models. what makes you guys so happy about zuckerburg? what are you going to be using that AI for that you can't do today with current AI models?

    • @ronaldronald8819
      @ronaldronald8819 หลายเดือนก่อน +1

      Right you are. Greetings from Holland.

  • @jeremybristol4374
    @jeremybristol4374 หลายเดือนก่อน +21

    Never thought I'd be rooting for the Zucc. This is awesome. Can't wait to try it out.

  • @gileneusz
    @gileneusz หลายเดือนก่อน +95

    just please adjust your questions, now LLMs are trained to answer questions like "code snake game in python". You need to give harder questions, like "code chess game in python" or "code go game in python"

    • @tozrimondher4250
      @tozrimondher4250 หลายเดือนก่อน +4

      Nope. Linear algebra, aka “ LLM “ aren’t trained on specific prompt style. It’s fine tuned on a range of possible answer style to imitate.

    • @artistaartificial5635
      @artistaartificial5635 หลายเดือนก่อน +9

      It's time to evolve It "Code centipede Game in Python"

    • @danielchoritz1903
      @danielchoritz1903 หลายเดือนก่อน +4

      ???go, chess? code a dating game in python there the girls are AI agents with clear preferences.

    • @DJ-dh3oe
      @DJ-dh3oe หลายเดือนก่อน +12

      ​@tozrimondher4250 if you're going to argue semantics you can't then say "Linear Algebra aka LLM" like they're the same thing

    • @HUEHUEUHEPony
      @HUEHUEUHEPony หลายเดือนก่อน +5

      he hasn't changed the rubrik since 1 year ago lmao he doesnt listen

  • @JamesRogersProgrammer
    @JamesRogersProgrammer หลายเดือนก่อน +21

    The synthetic data can be used to train a small model to be a specialist at a specific set of related tasks. Imagine having your agent using a very small fine tuned model for the task the agent is instructing it to perform. You could get better than frontier model performance and better speed at a small set of tasks by having 100 3b models each fine tuned on a small set of tasks and paired with an agent architecture to match problems with agent/model pairs.

    • @chrisjswanson
      @chrisjswanson หลายเดือนก่อน +1

      When many domain specific small models can work together to outperform the larger model that they learned from, we're right around the corner from agi

    • @user-qn6kb7gr1d
      @user-qn6kb7gr1d หลายเดือนก่อน +2

      ​@@chrisjswansonyeah, from there to agi it would be like going to moon on foot.

    • @chrisjswanson
      @chrisjswanson หลายเดือนก่อน

      @@user-qn6kb7gr1d 😑 come on be excited

  • @annellemiano
    @annellemiano หลายเดือนก่อน +105

    I'm favoured, $27K every week! I can now give back to the locals in my community and also support God's work and the church. God bless America.

    • @dottymlynek
      @dottymlynek หลายเดือนก่อน

      You're correct!! I make a lot of money without relying on the government.
      Investing in stocks and digital currencies is beneficial at this moment.

    • @milamicari
      @milamicari หลายเดือนก่อน

      Yes! I'm celebrating £32K stock portfolio today...
      Started this journey with £3K.... I've invested no time and also with the right terms, now I have time for my family an…

    • @JakartaJeff
      @JakartaJeff หลายเดือนก่อน

      Sonia Duke program is widely available online..

    • @vernonmilhouse
      @vernonmilhouse หลายเดือนก่อน

      Started with 5,000$ and Withdrew profits
      89,000$

    • @DanielleBens-u6k
      @DanielleBens-u6k หลายเดือนก่อน

      Sonia gave me the autonomy I need to learn at my own pace and ask questions when I need to she's so accommodating.

  • @andrew.derevo
    @andrew.derevo หลายเดือนก่อน +47

    Zucc became a legend🙌 totally changed my mind about him 😊

    • @anywallsocket
      @anywallsocket หลายเดือนก่อน +1

      The facts are the facts I don’t see the point in forming personal ‘would I hangout with this person in power’ feelings, like it’s a waste of brain power

    • @swojnowski453
      @swojnowski453 หลายเดือนก่อน

      zuck for president in case Trump can't win and Musk is too lazy to try. Human idiocy knows no boundaries ...

    • @z1mt0n1x2
      @z1mt0n1x2 หลายเดือนก่อน +1

      There's nothing to get excited about.
      Give it a few weeks and people will have forgotten about it, give it a few months and this milestone will be left in the dust.

    • @chrishayes5755
      @chrishayes5755 หลายเดือนก่อน

      lol. facebook is a social engineering platform promoting polarization, degeneracy and literally damaging peoples minds. zuck is trash and you're weak.

    • @raiden72
      @raiden72 หลายเดือนก่อน

      Mark -- they "Trust me.... dumb f***s" Zuckerberg?

  • @vincentthomas8492
    @vincentthomas8492 หลายเดือนก่อน +6

    Same, i would never thought i would jave thought Zuck would have play such a fair game, but so far he did and im happy to change my mind. Also, Merci Yann LeCun!

  • @devdavkup
    @devdavkup หลายเดือนก่อน +27

    I'm curious how much computational power is needed to support this model. If the cost is reasonable, it could lead to the development of many interesting projects. Meta has truly become the ambassadors of open-source AI, unlike OpenAI.

    • @efexzium
      @efexzium หลายเดือนก่อน

      Its not it costs around $40-65 per user per inference.

    • @kennethkho7165
      @kennethkho7165 หลายเดือนก่อน

      ​@@efexzium fake news

    • @carlosap78
      @carlosap78 หลายเดือนก่อน +2

      There is some guy who ran the 405B model with 2 x MacBook Pros with 128GB of RAM using an Exo cluster. Other than that, for the 4-bit 405B version, you need at least 8x 4090 gpus

    • @efexzium
      @efexzium หลายเดือนก่อน

      @@carlosap78 cool

  • @JohnSmith762A11B
    @JohnSmith762A11B หลายเดือนก่อน +9

    When I heard "16,000 H100 GPUs" I started freaking out like Doc Brown going, "1.21 GIGAWATTS!?"

    • @ronaldronald8819
      @ronaldronald8819 หลายเดือนก่อน

      Yea, but this is: "Into the Future" 🙂 And for real.

  • @NOTNOTJON
    @NOTNOTJON หลายเดือนก่อน +2

    Zucc's move here is intelligent. Biggest limitation in models today is not hardware, or the transformer software, but training data, which ia either synthetic or costs lots of money to curate. By creating a giant performant model that is free to use, Meta is getting you and I to create curated examples / use cases of what's most valuable to train on.

    • @NOTNOTJON
      @NOTNOTJON หลายเดือนก่อน

      To further this thought, I'll wager that 3.1 models have 3.0 user prompts + synthetic training data upgrades that make them better. Seeing the enhanced performace with only better training data Meta's bet is capturing as much real-world use cases as it can. It's a good move.

  • @MoDs_3
    @MoDs_3 หลายเดือนก่อน +17

    Am I starting to like Meta!? Thank you Zuck, and you Mat! ❤

    • @viyye
      @viyye หลายเดือนก่อน

      How about me?

    • @Mega-wt9do
      @Mega-wt9do หลายเดือนก่อน +3

      @@viyye who are you

    • @mirek190
      @mirek190 หลายเดือนก่อน

      @@viyye no one likes you

    • @viyye
      @viyye หลายเดือนก่อน

      @@Mega-wt9do I am the one who is watching along with you

    • @BardockOjama
      @BardockOjama หลายเดือนก่อน

      @@viyyethank you viyye wanna a kiss on the head and which one?

  • @Dr.UldenWascht
    @Dr.UldenWascht หลายเดือนก่อน +4

    Cool but, I'm gonna need an extra 50K to run this bad boy.
    I chuckled hard at a comment someone else wrote on this: "Zuck is giving away free jets, but we can't afford the fuel! 😄"

    • @Dygit
      @Dygit หลายเดือนก่อน +1

      It will probably be available on AWS Bedrock and Groq

  • @fynnjackson2298
    @fynnjackson2298 หลายเดือนก่อน +2

    This actually pretty awesome!
    1. Facebook = steals data
    2. Uses data for FREE quality AI
    3. Gives 'data' back to everyone

    • @clint9344
      @clint9344 หลายเดือนก่อน

      you missed a step... should 1 Facebook = Steals data 2. Sell to govt agencies for profit 3 use data for free quality AI 4. Give "Data" back to the people.. 👍

    • @adamspeaking373
      @adamspeaking373 หลายเดือนก่อน

      They didn't steal data, you gave it to them willingly in exchange for using their platforms.

  • @abdelhakkhalil7684
    @abdelhakkhalil7684 หลายเดือนก่อน +5

    The best news of the day is the 128k context window. The new 8B, if it's even close to Gemma2 9B,it would be a great model. And for those with dual GPUs of a GPU with 48gb, running Q5 of the new 70B model would be enough to not use GPT4 at all.

    • @jayco10125
      @jayco10125 หลายเดือนก่อน +1

      apparently it beats gemma9b

    • @RondorOne
      @RondorOne หลายเดือนก่อน +3

      You can run 70Bs even without 48GB VRAM. Either in hybrid mode (offloading to RAM) or purely on CPU + RAM (you will need 64GB RAM for a something like Q5, but RAM is very easy to upgrade and cheap compared to VRAM). Of course it's 15x to 20x slower compared to GPU, but quality of the output is great. Whether it's worth it depends on your specific use case of course.

  • @4Fixerdave
    @4Fixerdave หลายเดือนก่อน +1

    Well, I remember asking ChatGPT 3.5 a question when it first came out. Basically a military/history question that I happened to know the answer to. Not only did it fail miserably, it just invented facts for an answer. When I asked it to cite its sources, it invented them too. Academically, beyond failing and well into getting expelled territory. Just tried it now with this model. It nailed it. Oh yes, things are improving rather quickly. Oh yes.

    • @swojnowski453
      @swojnowski453 หลายเดือนก่อน

      it still gave you its version, you just know too little to figure it out. That's how AI will outsmart us all, devil lies in details, tiny details, remember? Things are not improving, they are worsening rapidly . The gullible turkeys keep voting for Christmas ...

  • @ew3995
    @ew3995 หลายเดือนก่อน +17

    knight x b6!
    your move openai

    • @phen-themoogle7651
      @phen-themoogle7651 หลายเดือนก่อน

      Qxf2# checkmate

    • @ronaldronald8819
      @ronaldronald8819 หลายเดือนก่อน

      I wonder what that they will come up with. It has to be good. My gut feeling tells me they will be out of the race. Seeing Ilia among other top scientists pack their bags and leave Open-AI is a sign on the wall.

  • @alkeryn1700
    @alkeryn1700 หลายเดือนก่อน +21

    zucc's redemption arc

    • @IkemNzeribe
      @IkemNzeribe หลายเดือนก่อน +1

      Death is his redemption.

    • @alkeryn1700
      @alkeryn1700 หลายเดือนก่อน

      @@IkemNzeribe ok openai shill

    • @IkemNzeribe
      @IkemNzeribe หลายเดือนก่อน

      @@alkeryn1700 💀

    • @suntzu6122
      @suntzu6122 หลายเดือนก่อน

      @@alkeryn1700 His company literally fed people divisive and hate-based content because it drives use. Sit the fxck down.

  • @Darkt0mb5
    @Darkt0mb5 หลายเดือนก่อน +40

    This new version of Zuckerberg is very human

    • @EnriqueAviles
      @EnriqueAviles หลายเดือนก่อน +4

      is actually an AI avatar, the real Zuck is still very alienlike

    • @content1
      @content1 หลายเดือนก่อน +2

      you are a bot

    • @youtuber9991
      @youtuber9991 หลายเดือนก่อน +6

      The AI training is making him more sentient and human-like lol

    • @friendofai
      @friendofai หลายเดือนก่อน +3

      That's because he has been upgraded to llama 4.1.

    • @Darkt0mb5
      @Darkt0mb5 หลายเดือนก่อน +1

      @@content1 no u

  • @alanritchie8890
    @alanritchie8890 หลายเดือนก่อน +4

    Everyone* in the future should have a well trained ai.
    *Anyone with a nuke bunker.

  • @jbavar32
    @jbavar32 หลายเดือนก่อน +3

    Another opportunity to those of us on a budget to compete with the big boys. Never thought I would say this but …Thanks Zuc

    • @swojnowski453
      @swojnowski453 หลายเดือนก่อน

      you want to compete with the big boys, do not make me laugh. You can only compete with then at who farts worse ...

  • @delatroy
    @delatroy หลายเดือนก่อน +2

    Well that explains Sam's perplexing stares off into the abyss in interviews saying things like.. I'm very worried right now

  • @christianjoachimgruber2773
    @christianjoachimgruber2773 หลายเดือนก่อน +2

    Didn't believe i ever would say this: Thanks Zucc

  • @PriNovaFX
    @PriNovaFX หลายเดือนก่อน +3

    Compared the 3.1 8B with the 3 70B, the new smaller 8B model is nearly in the same level as his bigger uncle.

  • @jamesvictor2182
    @jamesvictor2182 หลายเดือนก่อน +1

    What a great day. Zuck needs to keep the chain and wafro, it's working to make him more gnarly and rad. Seriously thank God for Yan Lecun

  • @greenockscatman
    @greenockscatman หลายเดือนก่อน +1

    Crazy that it got Snake right on first try. In a year we're gonna need to benchmark with Doom or something.

  • @boris.teoharov
    @boris.teoharov หลายเดือนก่อน

    Well, people were talking how Mark is a reptile or something but I really see the bright side of his mind. Well, he may have a business agenda but that's the way, bro! Props for boosting the open source community. We all have to remember our roots. If someone does not know Mark was pouring billions and billions of dollars into open source tech for the last 15 years. It is not only the latest LLAMA. This guy really deserves respect!

  • @tozrimondher4250
    @tozrimondher4250 หลายเดือนก่อน +1

    I tried the 3.1 8b for coding small plug-ins and it blew my mind. Imagine how good is the 405b model !!!

  • @pavi013
    @pavi013 หลายเดือนก่อน +4

    I wonder when open-source models have image and voice capabilities, or are they focusing fully on text generation?

    • @GraveUypo
      @GraveUypo หลายเดือนก่อน +1

      i had a model that had vision. but it was a pretty bad model so i think i deleted it when i purged my bad models from my ssd

  • @gu9838
    @gu9838 หลายเดือนก่อน +1

    wow NICE!!!!!! ai is awesome! and nice to see meta actually doing something positive for a change . i love llama 3 its pretty detailed for its smaller size at 8b downloaded 3.1 8b now!

  • @godoegor8431
    @godoegor8431 หลายเดือนก่อน +12

    If something is free, you are the product.

    • @HUEHUEUHEPony
      @HUEHUEUHEPony หลายเดือนก่อน

      It's not free, you need to give Nvidia money

    • @elakstein
      @elakstein หลายเดือนก่อน +8

      Doesn't apply to open source community

    • @hastyscorpion
      @hastyscorpion หลายเดือนก่อน +6

      lol that isn't how open source works dude

    • @godoegor8431
      @godoegor8431 หลายเดือนก่อน

      @@elakstein Yeh, Facebook is a "great" company with proven track record, handeling privat data! And just think about it what Gemini did with woke "mindset"!

    • @GraveUypo
      @GraveUypo หลายเดือนก่อน +3

      except there's no one to sell you. you clearly don't know what open source is. if you're so scared, you can run this on a machine with no internet access. it still works like nothing's changed. it doesn't connect to the internet.
      it's the free tier of chat gpt that sells you.

  • @MrBillythefisherman
    @MrBillythefisherman หลายเดือนก่อน +4

    Im not noticing a big difference between 3.1 70B and 405B - fractions better. Is this to be expected? Are we at the upper limits of throwing parameters at the problem?

    • @jayco10125
      @jayco10125 หลายเดือนก่อน

      there is a curve so yeah I think so

    • @thanos2527
      @thanos2527 หลายเดือนก่อน +1

      Nope. Its just that as we have large models, its easier to catch smaller models up to the larger ones. That does not mean the large ones will stop being much better over time

  • @DavidSmith-ef4eh
    @DavidSmith-ef4eh หลายเดือนก่อน

    Based Adam Corolla. I tried the 70b model on my 10th gen i9, it works but very slowly. Half a second for a token. I can't even imagine what is needed for the 405B model.

  • @MatthewMS.
    @MatthewMS. หลายเดือนก่อน

    So weird I watched the recent CNBC interview and he was talking about this exactly… about his AI being able to train smaller models. This is Great to see. Thanks Matthew B.!

  • @danieladler3210
    @danieladler3210 หลายเดือนก่อน +3

    A giant win for open source community!

    • @desertfish74
      @desertfish74 หลายเดือนก่อน

      No, it is not. Read their license terms.

  • @justindressler5992
    @justindressler5992 หลายเดือนก่อน +1

    I wonder when the student becomes the teacher. One day the small models may be able to generate, retrieve and filter relevant data for bigger models.

  • @nick1f
    @nick1f หลายเดือนก่อน

    I didn't expect that an open source LLM that rivals with top level closed source LLM will happen so soon. Amazing job and decision by Meta and Mark Zuckerberg

  • @OriginalRaveParty
    @OriginalRaveParty หลายเดือนก่อน +1

    Did I miss something? Everyone keeps comparing it to GPT 4o, when Claude 3.5 Sonnet is by far the leading model in the world right now. I know they're both right up there but 4o is undeniably second place.

  • @Imran-Alii
    @Imran-Alii หลายเดือนก่อน

    Great job explaining LLaMA 405b! Your clear breakdown made the tech accessible to all. Thanks for sharing your knowledge and enthusiasm!

  • @thewestindianboy
    @thewestindianboy หลายเดือนก่อน +1

    Rest of the closed source AI companies are like 'What the F>>>>??'

  • @MHTHINK
    @MHTHINK หลายเดือนก่อน +1

    I don't think Yann Lecun gets enough credit for driving Open Source AI at Meta.

  • @jarail
    @jarail หลายเดือนก่อน +2

    So happy to see Meta doing this!

    • @swojnowski453
      @swojnowski453 หลายเดือนก่อน

      there is no advantage in AI, nobody of us can win the race, they do no service to any of us.

  • @lesourire511
    @lesourire511 หลายเดือนก่อน

    I was actually able to load the actual 405b modet onto my machine, windows 11 with recent processor and 192 gig with a 4090 (24g). I was mostly curious to see if it would even load and if so would it run and believe it or not it actually did run using ollama locally. The reply to a question "are you there" came back many minutes later one word at a time with a minute or two between each word. Just thought it was interesting even though pretty darn impractical without a much more expensive GPU with more dedicated VRAM.
    I think you said that del provided you with their machine with two large Nvidia cards ... sweet. it would be so great to have such a machine

  • @FuturisticAgent
    @FuturisticAgent หลายเดือนก่อน

    Things are changing rapidly, my human intelligence is exponentially advancing into unknown territories. Great coverage and comment. 🚀

  • @GraveUypo
    @GraveUypo หลายเดือนก่อน +1

    wow! native 128k context? that's fkn awesome!
    i have a hacked llama 3 with 32k and i already think that's more than i need ever, but it doesn't work that well, it gets dumb the longer the context gets. if it's native, it's not going to suffer from this.
    can't wait for the quantized uncensored models to drop

  • @josgraha
    @josgraha หลายเดือนก่อน

    Heck yeah Matt, this is huge. Your coverage did it justice, as always. Thanks

  • @AlfredNutile
    @AlfredNutile หลายเดือนก่อน

    I said it in another channel about how crucial it has been to me as a developer to have a local free llm to use while I build web applications that use OpenAi or Claude api when deployed (until I can get better at hosting Ollama 😊)

  • @tonyppe
    @tonyppe หลายเดือนก่อน

    If you rephrase the number question it will get it right. It was answering based on 9.11 being higher than 9.9 like a version number.
    If you instead ask:
    "Which is a larger number? 9.11 or 9.9" then it gets it correct and explains why.
    If you ask it any other way then it gets it incredibly wrong.

  • @JohnSmith762A11B
    @JohnSmith762A11B หลายเดือนก่อน +2

    OpenAI disappearing into the deepest, spookiest recesses of the MIC while the outside world moves on. Fare thee well, Sam!

    • @tzardelasuerte
      @tzardelasuerte หลายเดือนก่อน

      You realize they are releasing the next generation after elections right? Everyone has caught up but they are already on the next generation

  • @samuelbaker5577
    @samuelbaker5577 หลายเดือนก่อน

    It was a fun chat discussing using bubbles of space-time to move smaller black holes en masse to manipulate a larger one

  • @executivelifehacks6747
    @executivelifehacks6747 หลายเดือนก่อน

    Agree - this is epochal. Well done Matt, I appreciate your FOSS perspective, one I share. And whoa, Zuck, incredible likeability arc! Can't wait to play with this...

  • @SiimKoger
    @SiimKoger หลายเดือนก่อน

    Zuck actually lived long enough to see himself become a hero again.

  • @ChristopherOBrienPSU
    @ChristopherOBrienPSU หลายเดือนก่อน +2

    Crazy that Meta is more open than OpenAI 😂

    • @swojnowski453
      @swojnowski453 หลายเดือนก่อน

      they just do not want the shit ...

  • @martins2246
    @martins2246 หลายเดือนก่อน

    I think 405b is going to be 99.9% peak llm. Can't wait to see how it goes for you running it. I love ollama ....the llama3:8b crushes all my needs.

  • @jackflash6377
    @jackflash6377 หลายเดือนก่อน

    Already on Ollama, downloading now. Exciting!!!
    Right off the bat it has a sense of humor.
    "It looks like you meant to type "Hello" but your keyboard stuck on a single key, resulting in the word "Greetings". That's a funny bug!
    If you intended to say hello, I'd be happy to respond in kind. Otherwise, is there something on your mind that you'd like to discuss?"

  • @jakeparker918
    @jakeparker918 หลายเดือนก่อน +1

    So exciting. Can't wait to see this turned loose on Groq

  • @RolandGustafsson
    @RolandGustafsson หลายเดือนก่อน

    I installed both the 8B and 70B models on my MacBook Pro M1 Max with 64GB RAM. The 8B model runs super fast and is pretty amazing considering the memory footprint of only 4.7GB. (Not sure exactly how much space it takes in RAM, but that was the download size) The 70B runs MUCH slower and the fans kick in, 40GB download. Not sure if I can see enough improvement to warrant using it instead of zippy 8B.

  • @thetrueanimefreak6679
    @thetrueanimefreak6679 หลายเดือนก่อน +80

    mark is actually becoming cooler and cooler as this goes on

    • @dockdrumming
      @dockdrumming หลายเดือนก่อน +1

      Agreed. It's incredible. 😂

    • @enoque2479
      @enoque2479 หลายเดือนก่อน +8

      As AI is advancing more and more, he is becoming more and more human 🤨

    • @MilkGlue-xg5vj
      @MilkGlue-xg5vj หลายเดือนก่อน +2

      ​@@enoque2479Yeah their AI technology helps hide his robot side and look more human, don't trust me? I've been working with Meta for 5 years and I've just been fired last month.

    • @jalen2172
      @jalen2172 หลายเดือนก่อน

      @@MilkGlue-xg5vjgood, see ya

    • @JohnnyTwoFingers
      @JohnnyTwoFingers หลายเดือนก่อน +1

      He is, my opinion on him has somehow completely flipped lol

  • @user-ly2lh9ml4d
    @user-ly2lh9ml4d หลายเดือนก่อน +1

    Every time you test, you ask for it to write the snake game in python... assuming correctly that it knows what the snake game is, because it has been in the training data.
    Wouldn't is be better to ask it to write specs for the snake game, and then ask it to write the game from the spec to see if it works as expected...

  • @HMexperience
    @HMexperience หลายเดือนก่อน +1

    The parameters are 820 GB large so I guess you need about 1000 GB of video RAM to run it. That would be like 12 H100 in a cluster.

  • @HaraldEngels
    @HaraldEngels หลายเดือนก่อน +3

    Yes, that is a watershed moment ...

  • @mrpocock
    @mrpocock หลายเดือนก่อน

    Very long contexts locally will be very useful. If the model doesn't slow down horribly.

  • @mariusirgens5555
    @mariusirgens5555 หลายเดือนก่อน

    I am looking forward to implementing this into my new local LLM based roleplaying game system I am working on 🥳 I was using phi3-128K, but it got worse as the game progressed and the chat history got long…

  • @PinakiGupta82Appu
    @PinakiGupta82Appu หลายเดือนก่อน

    Great effort! Meta could potentially reap significant profits from the model in the future while also contributing to FOSS, which is fantastic. However, one important point to consider is that any large language model that cannot be run locally may not be ideally useful for end users who wish to run the LLM on their own systems. I'm sure, somebody will publish a quantised version, even if it's not Meta, that's good.

  • @vi6ddarkking
    @vi6ddarkking หลายเดือนก่อน +2

    This is welcomed but not altruistic.
    With the Ecosystem they are building Meta is going to make bank renting out the GPUs and tools that used to fine tune and Build around Llama.
    You know, the old "In a gold rush you sell shovels." And Honestly as long as they keep it Free and open as they are now.
    This is a win win, so hey good on Meta for quite the crafty business model.

  • @sirellyn
    @sirellyn หลายเดือนก่อน

    I'm even more concerned about censorship with it.

  • @bioshazard
    @bioshazard หลายเดือนก่อน

    Appreciate you advocating for the OpenAI API standard. The industry really just needs to lean into that unless new modalities are being released outside of OAI that aren't being kept up by their schema. Even then, I would prefer the community extend their schema than to keep making new ones. Its all the same payloads...

  • @content1
    @content1 หลายเดือนก่อน +1

    "WHAT NOT TO WEAR"
    -Zuks, people are saying FB is for grannies.
    - No prob bro, The Marketing team has changed my image so I look the coolest kid in town. NOT.

  • @KJ-xt3yu
    @KJ-xt3yu หลายเดือนก่อน

    larger modle comes out, the smaller ones go through heavy testing and refining through wide use

  • @voinea12
    @voinea12 หลายเดือนก่อน +1

    Thank you zucc 🙏🙏

  • @harambae117
    @harambae117 หลายเดือนก่อน

    Nobody expected the Zucc redemption arc, but here we are. Let's see how far he takes it.

  • @worldhello8908
    @worldhello8908 หลายเดือนก่อน

    even model 3 70b was not able to be run on 80gram.
    But it is pretty amazing on the improvements on every models on 3.1

  • @esteban-alvino
    @esteban-alvino 23 วันที่ผ่านมา

    I did snake too, run smoothly. thank you

  • @fenix20075
    @fenix20075 หลายเดือนก่อน +1

    Really looking forward its dolphin version and maid version.

  • @ianPedlar
    @ianPedlar หลายเดือนก่อน +2

    Put AI and voice commands into the Quest 3
    Gosh, you have to hunt and peck at a virtual keyboard at the moment.

  • @Arcticwhir
    @Arcticwhir หลายเดือนก่อน

    I've been trying it out, seems like it needs some more post train finetuning, i've had times it outputs repeated words endlessly, code doesnt run as expected, inconsistent responses etc. Its really cool that they are releasing this open source, hope other large companies can improve upon this model. Or maybe it needs a better system prompt on meta ai

  • @blesch
    @blesch หลายเดือนก่อน

    I can't believe people aren't even shocked

  • @lyxsm
    @lyxsm หลายเดือนก่อน

    You know the world is fcked, when the zuck is the good guy.

  • @remaincalm2
    @remaincalm2 หลายเดือนก่อน

    What's in it for Zuck? He must have a game plan that's several moves ahead of our thinking. Beware of Greeks bearing gifts.

  • @michamohe
    @michamohe หลายเดือนก่อน

    something that I think could be a really cool addition to routellm would be the ability to pick between multiple llms based on current variables. so have gpt4o for really difficult questions, llama 405b on Groq for important but not super important prompts and additional smallish specialized models for things they would be good for
    basically, being able to choose within a pool of models instead of just the 2

  • @quiltingrox
    @quiltingrox หลายเดือนก่อน

    Zucky has achieved a new arc era

  • @wardehaj
    @wardehaj หลายเดือนก่อน

    Just made a query with groq with lama3.1 8b instant model: 750.000 T/s! Super fast!❤
    And it still understands and answers in the not (3.1) supported Dutch language, awesome

  • @sigmareaver680
    @sigmareaver680 หลายเดือนก่อน

    I'm just increasingly disappointed that they are neglecting the 13b-30b sized models, which would allow consumers to maximize the use of consumer GPUs.

  • @phucnguyenxuan5227
    @phucnguyenxuan5227 6 วันที่ผ่านมา

    What about Cerebras systems inference speed ? any analysis and comparison with groq and other approach ?

  • @dejabu24
    @dejabu24 หลายเดือนก่อน

    Meta is underrated in this they do some good stuff

  • @yahm0n
    @yahm0n หลายเดือนก่อน

    The heroes of AI during the runup to the singularity will be remembered until the end of human civilization. Hopefully that will be a long time from now...

  • @user-cz9bl6jp8b
    @user-cz9bl6jp8b หลายเดือนก่อน

    I've watched probably hundreds of videos stating this changes EVERTYTHING and nothing has changed!

  • @stevefox7469
    @stevefox7469 หลายเดือนก่อน +2

    I don't understand synthetic data. Won't synthetic data be full of inaccuracies, and ultimately higher hallucinations?

    • @martfp88
      @martfp88 หลายเดือนก่อน

      Not necessarily, think about it this way. Let's say I'm using Chatgpt for creating synthetic data.
      Real text -> trains Chatgpt -> Chatgpt becomes very close at real texting -> very close to real texting (ie synthetic data) -> trains new model -> which means that the new model is training to be Chatgpt

  • @pmarreck
    @pmarreck หลายเดือนก่อน +1

    1) What kind of hardware do I need to own to run the full-size model at full speed?
    2) What kind of hardware do I need to refine it further? (speed doesn't matter much there)

  • @sychrov81
    @sychrov81 หลายเดือนก่อน +1

    hey guys, how to run these big models on regular gaming pc, I tried the lama (think 40B), but my pc almost burned down 😅 (ryzen7, rtx4070,32gb ram)

  • @tzardelasuerte
    @tzardelasuerte หลายเดือนก่อน

    At some point they will stop open sourcing it. Specially after hitting the petaflop limit set by us government

  • @ajeybs4030
    @ajeybs4030 หลายเดือนก่อน

    The day this guy came up with meta, then i knew zuck is not the same advert sucking monger anymore and the change is for good. Like his new style.. rocking it- keep up!