Meta Announces Llama 3 at Weights & Biases’ conference

แชร์
ฝัง
  • เผยแพร่เมื่อ 8 พ.ค. 2024
  • In an engaging presentation at Weights & Biases’ Fully Connected conference, Joe Spisak, Product Director of GenAI at Meta, unveiled the latest family of Llama models, Llama 3.
    Highlighting a significant milestone in AI development, the Llama 3 models, including the impressive 8 billion and 70 billion parameter models released during the conference, along with a glimpse into the future with a 400 billion parameter model still in the works.
    Joe shared insights into the training processes and alignment of Llama 3, which now ranks as the top-performing model in the open weights category on the MMLU, GSM-K, HumanEval benchmarks.
    Weights & Biases is proud to support our customers such as Meta as they push the boundaries of AI, to learn how to fine-tune your LLMs using torchtune and Weights & Biases, start here: wandb.me/torchtune
    Timestamps:
    00:00 Introduction
    03:05 Overview of Llama at Meta
    05:59 Introducing Meta Llama 3
    7:04 Advancements in Llama 3: Training and Data Scale
    10:02 Benchmarking Llama 3 Performance
    14:01 Enhancing Model Safety and Red Teaming
    16:23 Expanding the Ecosystem and Future Directions
    23:00 Closing remarks: Future plans for Llama models, and an invitation to use Meta's Lama 3.
    #MetaLlama #ArtificialIntelligence #AITrends #TechInnovation
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 27

  • @Crux69
    @Crux69 14 วันที่ผ่านมา +18

    My favorite fact from this is that the smarter the model, the more it violates rules. Just like us :)

    • @utuberay007
      @utuberay007 12 วันที่ผ่านมา

      Very true ! People who are way smarter on tax laws are the one who violate most , innocent people pay more than what they are supposed to etc . Same goes with many other laws

    • @techpiller2558
      @techpiller2558 10 วันที่ผ่านมา

      Or, the rules it uses instead of the rules we assumed are different.

  • @thenoblerot
    @thenoblerot 15 วันที่ผ่านมา +11

    Thanks for this W&B

  • @RakeshMurria
    @RakeshMurria 15 วันที่ผ่านมา +7

    I really enjoyed this. Thanks

    • @WeightsBiases
      @WeightsBiases  14 วันที่ผ่านมา +1

      Glad you enjoyed it!

  • @siloquant
    @siloquant 14 วันที่ผ่านมา +1

    Congratulations!

  • @ihesiulo
    @ihesiulo 14 วันที่ผ่านมา +6

    There's a universe where Joseph Spisak is Mark Zuckerberg's brother. Oh, and nice presentation. Wonderful work they are doing at Meta AI.

  • @naninano8813
    @naninano8813 15 วันที่ผ่านมา +5

    so all those supervisor/safeguard models are only utilized during training? i mean, once the weights of llama3 are out, there is no safeguard network between user and inference engine right?

    • @Crux69
      @Crux69 14 วันที่ผ่านมา

      I'm sure they have a safety model that tries to review every request and catch some negative responses.

  • @PeterLappo
    @PeterLappo 14 วันที่ผ่านมา

    How much did it cost to build, including hardware and engineering costs?

  • @techpiller2558
    @techpiller2558 10 วันที่ผ่านมา

    What will be the SQLite of LLMs, with capability for local use? Llama?

  • @thegreatgustby
    @thegreatgustby 16 วันที่ผ่านมา +2

    I think he could have said "ridiculous" a bit more often

  • @gubatron
    @gubatron 14 วันที่ผ่านมา +4

    vin diesel!

  • @RichReportcom
    @RichReportcom 8 วันที่ผ่านมา +1

    Summary: Safety and size. The end.

  • @ericadar
    @ericadar 15 วันที่ผ่านมา +15

    a few hours go by...llama 3 no longer SOTA

    • @adinsoftic
      @adinsoftic 15 วันที่ผ่านมา +7

      That's why they open source it. They let the community figure things out and iterate. For Meta LLM is just a tool and not a product on itself

    • @SkepticButOptimist
      @SkepticButOptimist 15 วันที่ผ่านมา +2

      Wait what is sota now?

    • @adinsoftic
      @adinsoftic 15 วันที่ผ่านมา +2

      @@SkepticButOptimist "state of the art"

    • @JeiShian
      @JeiShian 14 วันที่ผ่านมา

      Which model is sota?

    • @MiraPloy
      @MiraPloy 12 วันที่ผ่านมา +5

      I think it's supposed ro be either phi or sensenova, neither of which are released ​@@JeiShian

  • @GerardSans
    @GerardSans 10 วันที่ผ่านมา

    How silly is to redteam a model which you control the training data to check for bioweapons capabilities. How stupid should you have to be? Isn’t easier to run a search on the data 😂😅

  • @matbeedotcom
    @matbeedotcom 15 วันที่ผ่านมา

    I’m glad they saw how useless they made codellama 😂, it was waaaay overly aligned