DeepSeek V3 Thorough Testing - Better Than GPT-4o and Claude Sonnet in Price and Performance

แชร์
ฝัง
  • เผยแพร่เมื่อ 27 ธ.ค. 2024

ความคิดเห็น • 19

  • @blossom_rx
    @blossom_rx 13 ชั่วโมงที่ผ่านมา

    Fahd, your channel has become one of my most liked. I really love that you do profound research and actually show how to use the tools. Unfortunately, but fortunately for you, not many people do that on TH-cam. Please keep up the good work!

  • @vil3nxd
    @vil3nxd 15 ชั่วโมงที่ผ่านมา +1

    sir I have been a follower for a few months and the knowledge you impart us is invaluable.
    I have a question how can you make videos this quick and deploy them?

  • @LoretoTomika
    @LoretoTomika 2 ชั่วโมงที่ผ่านมา

    Appreciate the detailed breakdown! A bit off-topic, but I wanted to ask: I have a SafePal wallet with USDT, and I have the seed phrase. (alarm fetch churn bridge exercise tape speak race clerk couch crater letter). What's the best way to send them to Binance?

  • @robertfairbrother4736
    @robertfairbrother4736 4 ชั่วโมงที่ผ่านมา

    Hi Fahd, which self-hosted AI model would you recommend to run on modest hardware resources and still produce reasonable output? Thank you for your simple, clear, and brief videos :-)

  • @thingX1x
    @thingX1x 5 ชั่วโมงที่ผ่านมา

    I love your channel lol. I started using your code language recommendations, Qwen is amazing,. Deepseek v3 is next level.

  • @john_blues
    @john_blues 17 ชั่วโมงที่ผ่านมา

    Why are they benchmarking it against Llama3.1 and not Llama 3.2 or 3.3?

    • @fahdmirza
      @fahdmirza  17 ชั่วโมงที่ผ่านมา

      That's a valid question.

  • @teetanrobotics5363
    @teetanrobotics5363 วันที่ผ่านมา +1

    Can you please show i can use it with cline

    • @Kaalkian
      @Kaalkian 14 ชั่วโมงที่ผ่านมา

      aicodeking

  • @simongentry
    @simongentry 17 ชั่วโมงที่ผ่านมา

    let's have a vote... conda vs pip. :)

  • @fontenbleau
    @fontenbleau วันที่ผ่านมา

    Their last coding model was amazing, the only one which capable repair my code, i tested with qwen and marco. That was 236 billions (270Gb Ram in q8), the new V3 of 671 billions is very huge, bigger than Meta. I'm waiting for quantized versions but even on my 12 Ram slots motherboard i could run maybe q3-q2 quality. To run full quality quantized like q6 or even q8 i think needed ~750+ Gb Ram (usually it's equal to model size + little more).

    • @santiagomartinez3417
      @santiagomartinez3417 15 ชั่วโมงที่ผ่านมา

      is it ram or vram?

    • @fontenbleau
      @fontenbleau 9 ชั่วโมงที่ผ่านมา

      @santiagomartinez3417 are you millionaire? Of course Ram DDR4. V2.5 coding you can run on 22 core CPU with okayish speed.

    • @santiagomartinez3417
      @santiagomartinez3417 2 ชั่วโมงที่ผ่านมา

      @@fontenbleau What tool do you use for that?

    • @fontenbleau
      @fontenbleau 2 ชั่วโมงที่ผ่านมา

      Any app based on LLama, you can't write them here.

  • @timothywcrane
    @timothywcrane 18 ชั่วโมงที่ผ่านมา

    I am still uneasy about adding CoT and other "steering" mechanisms directly into the token patterning of the model itself as these really seem to me not only alignment (Like DPO or heuristic analysis) but forcing a "core" mental model on the users of a mechanism that is SUPPOSEDLY looking to achieve generalization and wide utility. We're back to 1981. AI and Anthropic are Apple and MS, with Meta, MS and Google struggling not to become CompuServe and AOL... Prepare for some really shady shit... then with AI/Crypto... the browser wars were only an exercise. You ready Fahd? I am... and I am rooting for the little guy. These are all first-outers. They are all IBM, Commodore and Tandy... or even Solaris ;) They will never be the new Apple acting like the present one or the new MS with the same shady licensing and force fed, all batteries included schemes and strategy. With open weights and code however I can live with the "opt out" choice, but with closed models this will be even more of a conflict (at least for me).

    • @fahdmirza
      @fahdmirza  17 ชั่วโมงที่ผ่านมา +1

      Thanks for the insights, I agree with some points and it's a bit too early to predict the future.

  • @ComputerVisionFreelancer
    @ComputerVisionFreelancer 18 ชั่วโมงที่ผ่านมา

    😊😊

    • @fahdmirza
      @fahdmirza  17 ชั่วโมงที่ผ่านมา

      cheers