Deepseek r1 vs openai o1 vs o3 mini Who is the Best Coding Model?

แชร์
ฝัง
  • เผยแพร่เมื่อ 9 ก.พ. 2025

ความคิดเห็น • 92

  • @YJxAI
    @YJxAI  วันที่ผ่านมา +1

    at 8:34 many have pointed out that I don't have the deep think r1 button pressed. Which if not pressed will then use deepseek v3. Thanks a lot for having such keen observation. Regarding the issue. I actually did find it out while I was testing and used deepseek r1 and the code that was generated was from deepseek r1 but in editing had to keep the zoomed part so to show the prompt I showed the one without the button pressed but be rest assured it's deepseek r1. :)

  • @ibreathpotates5558
    @ibreathpotates5558 4 วันที่ผ่านมา +38

    ai is also not safe from indian lecturing

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา +9

      🤣

  • @RyluRocky
    @RyluRocky 4 วันที่ผ่านมา +9

    The crazy part about this is that the AI’s currently aren’t allowed to see/visualize the result of their code, so they’re essentially doing this blind.

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา +1

      Exactly

  • @thirugnanaskandhan9930
    @thirugnanaskandhan9930 วันที่ผ่านมา +2

    hey, you are not using deepseek r1, just enable the deepthink r1 button below to see its full potential. since, you are not using deepthink the comparison is considerable dude

    • @YJxAI
      @YJxAI  วันที่ผ่านมา

      I have used it actually in one of the instances I send it without deep think and that was highlighted but I did change it back to deepseek r1 but that did show up In the edit. . Thanks for having a closer look at my video.

  • @torarinvik4920
    @torarinvik4920 4 วันที่ผ่านมา +9

    o3-mini is impressed me the most since Claude 3 Opus came. I can't imagine how good these will be in just 6 months, they will probably be true beasts.

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา +2

      Yeah o3 pro 😍

    • @torarinvik4920
      @torarinvik4920 4 วันที่ผ่านมา +1

      @@YJxAI But it's probably going to be too slow. Unless it's moderately fast Im not gonna use it.

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา

      @ I am okay with speed but te weekely limits yuck!

    • @jktech2117
      @jktech2117 4 วันที่ผ่านมา +1

      reminder that deepseek r1 was only a side project

    • @nileshpatil3996
      @nileshpatil3996 3 วันที่ผ่านมา +1

      In 6 months one of company will achive the AGI

  • @dennvandante
    @dennvandante 3 วันที่ผ่านมา +1

    That was a really nice comparison. Great video man. Definitely deserves more views. 💪🏻

    • @YJxAI
      @YJxAI  3 วันที่ผ่านมา

      thanks a lot

  • @Umangpy
    @Umangpy 4 วันที่ผ่านมา +10

    You shounld add claude in these comparisions too. It’s still a beast at coding

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา +1

      Do you use cursor. or any other agentic IDE and do you use Claude in it.

    • @monsterasap1827
      @monsterasap1827 4 วันที่ผ่านมา +2

      ​@@YJxAI yeah cursor also supports claude,
      Okay on question I used cursor for 1 month after that it says unlimited access has been over.. So does it mean I got some limitations? If so what is it

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา

      @@monsterasap1827 free trials I guess..
      I did hesitate at firs but now
      I am on pro subscription and it seems unlimited to me. I daily do coding in it. It's worth it.

  • @Benette2
    @Benette2 3 วันที่ผ่านมา +2

    nice comparison! Love these type of video!

  • @Metrix224
    @Metrix224 2 วันที่ผ่านมา +1

    Hey everyone i dont wanna watch it all so please tell me which one

  • @ComeTowardsIslam-mn9pb
    @ComeTowardsIslam-mn9pb 2 วันที่ผ่านมา +1

    The Deepseek R1, OpenAI O1, and O3 Mini are all top contenders in the coding model space, each offering unique strengths and capabilities. Ultimately, the best coding model for a particular project or use case will depend on specific needs and requirements, making a thorough comparison and evaluation essential.

  • @Vishnuthealmighty
    @Vishnuthealmighty 4 วันที่ผ่านมา +5

    How is DeepSeek R1 even responding? Whenever I try, it gives the error: 'The server is busy. Please try again later.' Do you have a solution for this?

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา +2

      glad you asked. It works best around 1 am IST.

    • @Family_Guy_12
      @Family_Guy_12 4 วันที่ผ่านมา +1

      its called the art of video editing 😂

    • @cariyaputta
      @cariyaputta 4 วันที่ผ่านมา +4

      It works best around the time Americans go to sleep.

    • @naman.0316
      @naman.0316 4 วันที่ผ่านมา

      if you want to use Deepseek R1, just use it on huggingface or perplexity, they have hosted Deepseek R1 on their website, and they don't have Chinese censorship as well.

    • @jeffwads
      @jeffwads 4 วันที่ผ่านมา

      He isn't doing it live, but when it works. Amazing, eh?

  • @Samuelkings
    @Samuelkings 4 วันที่ผ่านมา +1

    Been waiting for this bro, thanks

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา +1

      🥹

  • @ComeTowardsIslam-mn9pb
    @ComeTowardsIslam-mn9pb 2 วันที่ผ่านมา +1

    The battle for coding supremacy is on, but which model reigns supreme: Deepseek R1, OpenAI O1, or O3 Mini, each with its unique strengths and capabilities.

  • @adams546
    @adams546 3 วันที่ผ่านมา +3

    Now this is what we call a coding test for SOTA LLMs, not some how many Rs or creating mock website

    • @YJxAI
      @YJxAI  3 วันที่ผ่านมา

      🥹

  • @satishanu147
    @satishanu147 4 วันที่ผ่านมา +8

    Please compare o3 mini with Claud 3.5 Sonnet

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา +2

      do you use it in cursor or any other ide

    • @mymoviemania1
      @mymoviemania1 4 วันที่ผ่านมา +3

      Yes I do. Claude sonnet 3.5 is better than R1 and o1. Don’t know about o3

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา +1

      @ better than o3 mini better than any other model in cursor by leaps and bounds.

  • @pechkurofff
    @pechkurofff วันที่ผ่านมา +1

    you forgot to choose deepseek r1 when prompting a dragon

    • @YJxAI
      @YJxAI  วันที่ผ่านมา +1

      I corrected it don't worry but that was trimmed in the video.

    • @pechkurofff
      @pechkurofff วันที่ผ่านมา +1

      @@YJxAI ok, keep up the work, your videos are one of the best for testing

  • @ComeTowardsIslam-mn9pb
    @ComeTowardsIslam-mn9pb 2 วันที่ผ่านมา +1

    The battle for coding supremacy is heating up with Deepseek R1, OpenAI O1, and O3 Mini, each boasting impressive capabilities, but only one can be crowned the best. Ultimately, the choice between these models depends on specific needs and preferences, as each excels in unique areas of coding and problem-solving.

    • @YJxAI
      @YJxAI  2 วันที่ผ่านมา

      yeah you are right

  • @videosclips_
    @videosclips_ 4 วันที่ผ่านมา +4

    I think that the new LLM models which are being launched come pre-optimized for simple questions like snake game, calculator, 9.11 vs 9.9 etc. 😅 But if you ask a different question, they do not give a proper answer😊

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา +3

      that's why trying new things. I hope people like it

  • @meowWeee
    @meowWeee 2 วันที่ผ่านมา +1

    if your comparision dont have claude sonet, then your whole comparision took a wrong turn

  • @sLavoncheg
    @sLavoncheg 4 วันที่ผ่านมา +2

    just look at API usage comparison by amount of tokens, thats why no reason add Claude here, cuz it's still in top
    despite that fact it has the oldest update

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา

      O3 mini doesnt cost that much as it seems cause claude is combined 18$ vs 5.5 $ of o3 mini which seems to be high but seeing my api usage i can conform its actual equal or cheaper.
      But even ignoring that claude has some points to its favour which i think of discussing when i get time but have to test more o verify it.

  • @99DemonArts
    @99DemonArts 3 วันที่ผ่านมา +1

    Use v3 for coding r1 is for reasoning

  • @puneet1977
    @puneet1977 3 วันที่ผ่านมา +1

    Interesting stuff. good to see how they perform. But I think the best will be using the most commonly used coding languages, since those will be most commonly used.

  • @toji_reborn
    @toji_reborn 4 วันที่ผ่านมา +2

    Now hear me out, what if you could combine all of every single AI that is used for coding Into ONE? that'd be so OP bro oml

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา

      yeah

  • @mymoviemania1
    @mymoviemania1 4 วันที่ผ่านมา +2

    R1 is free whereas o3 and o1 are paid.

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา +2

      yeah big , very big plus point but be cautious with sensitive data on the deepseek website.

  • @paulojo720
    @paulojo720 4 วันที่ผ่านมา +3

    Not sure thats deepseek r1. The first dragon prompt isn't using the deepthink when you send the request

    • @jeffwads
      @jeffwads 4 วันที่ผ่านมา +2

      Yeah, you are right. That does make a huge difference.

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา

      yeah actually I generated and I didn't see any thinking tokens. But then I corrected it and clicked the deep think button but at that time I was not speaking so that didnt' show up.
      TLDR Dont' worry guys it's r1. But thanks for paying so much attention . ❤️

  • @Ginto_O
    @Ginto_O 4 วันที่ผ่านมา +2

    That ball test was absolute failure for all models.

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา

      o3 did maybe a bit better but they could have been better if I used three.js but in that many assets are available already so looks good but not too hard for models. so wanted to make them do things from scratch.

  • @iscifion7122
    @iscifion7122 3 วันที่ผ่านมา +1

    Sonnet 3.5 is best coding model.

  • @MandaniSikyana-r3g
    @MandaniSikyana-r3g 4 วันที่ผ่านมา +1

    Do you have information about new claude?

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา

      3.5 new yes i have made 3 videos on it

  • @nnnscorpionnn
    @nnnscorpionnn 4 วันที่ผ่านมา +1

    I wasn't expecting Indian accent when I opened the video. However, thanks for the comprasion.

  • @devinegamingtv3427
    @devinegamingtv3427 4 วันที่ผ่านมา +1

    To make a comparison I think everyone can understand. I think the current o3-mini-high is the equivalent t to a GeForce GTX 250, and to get real world performance that can do heavy work in real world example of this we need to get it through several revolutions closer to the RTX 3000 series and above.
    The GTX 250 could boot a modern AAA game, but at sub HD resolution with everything at low, but it's not good to look at when you get 20-30 fps.

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา

      yeah good comparison the only difference being they released cards every 2 years with around 50 % or something like that improvement whereas ai labs saturate benchmark in months. I think the exponential curve is steeper here.

    • @devinegamingtv3427
      @devinegamingtv3427 4 วันที่ผ่านมา +1

      @@YJxAI Definitely a lot steeper, at least, i hope they can continue the steep curve :)

  • @alexandreivanov1417
    @alexandreivanov1417 2 วันที่ผ่านมา +1

    bro talking about coding model but not about claude sonnet 3.5 i'm done

    • @YJxAI
      @YJxAI  2 วันที่ผ่านมา

      talked about Claude in more than three videos I am done.

  • @Vishnuthealmighty
    @Vishnuthealmighty 4 วันที่ผ่านมา +3

    AI is dumber than I thought.

    • @depression7807
      @depression7807 4 วันที่ผ่านมา +2

      Not really I think it's not trained on blander's dataset. Or they might forget it try finetuned version of this models or make one

  • @bodethoms8014
    @bodethoms8014 4 วันที่ผ่านมา +1

    R1 did better when I tried it myself. Something is off about your r1

  • @gxguys
    @gxguys 4 วันที่ผ่านมา

    is that just me or o1 now tries to generate the response a lot quicker than before by a lot ?

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา

      It fluctuates on release very less time thinking. then in between sometimes more time thinking and now again. Maybe openai tweaks the time based on what is the general sentiment of people or is there any completion or it may just be a conspiracy

  • @vetriselvan9807
    @vetriselvan9807 ชั่วโมงที่ผ่านมา

    Claude 3.5 sonnet is the best coder in my perspective

  • @volotem
    @volotem 4 วันที่ผ่านมา +1

    you use 03-mini-high, not o3-mini, (((

  • @bosterdrone1379
    @bosterdrone1379 3 วันที่ผ่านมา +1

    github copilot bro

  • @messengercreator
    @messengercreator 3 วันที่ผ่านมา +1

    these AI model is best AI in the whole world or they say big company dumbest AI in the world

  • @susanneschroder8409
    @susanneschroder8409 4 วันที่ผ่านมา +2

    Open ai 3 mini🎉🎉🎉

  • @hmmmmmm_3429
    @hmmmmmm_3429 4 วันที่ผ่านมา +1

    llms cant generate 3d objects like a dragon or cup lol, use different model for that test, diffusion 3d models.

  • @Family_Guy_12
    @Family_Guy_12 4 วันที่ผ่านมา +1

    bro thanks for you efforts
    but i don't like the new benchmark
    the old one with a table and hard reasoning tasks is better

  • @AllInfo-9
    @AllInfo-9 2 วันที่ผ่านมา

    The best coders are software engineers. Code from AI are a copy of code get from software engineers. Ai is hype and fraud

  • @divugoyal415
    @divugoyal415 4 วันที่ผ่านมา +1

    flash thinking ka naam suna he baap he in teeno ka

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา

      watch my video on it :)

    • @divugoyal415
      @divugoyal415 4 วันที่ผ่านมา

      @YJxAI video banai to isme kyu nahi laaye you are showing this model are best

  • @videosclips_
    @videosclips_ 4 วันที่ผ่านมา

    Bro, please compare with gemini 1206 etc models. They are also good

    • @YJxAI
      @YJxAI  4 วันที่ผ่านมา +1

      yes it's a model I love and have covered it as well I have also covered the flash and thinking model please check my channel.
      and 1206 is officially going to go pro soon. So stay tuned for that....😁