Apple's storage pricing is INSANE! Do this instead.

Transformers (how LLMs work) explained visually | DL5

NVIDIA's $249 Secret Weapon for Edge AI - Jetson Orin Nano Super: Driveway Monitor

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

แหกหน้าพ่อค้าจีน 2 #hagatestudio #fun #funny #พากย์นรก

OpenAI o3 mini FAILED My Test BIG TIME? (FREE)

Mervin Praison

มุมมอง 5 196

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 7 ก.พ. 2025

ความคิดเห็น • 36

@toddschavey6736 7 วันที่ผ่านมา ⁺⁸
Deepseek R1 + V3 combo with Cline is amazing.
Id want to try Cerebras R1 70B in place of V3 next. If you you havent heard ov Cerebras, the make AI Wafer and the output token rate is nearly instant. Up until recent, they only had crap Lama 70b
@micbab-vg2mu 6 วันที่ผ่านมา ⁺¹
I wll try this combo so far - at the moment I use Cusrosr and Windserf :)
@pepefrogic3034 6 วันที่ผ่านมา
Similar problems are with R1. CoT is useless if requests are misunderstood whixh happens a lot. 4o performs better than o3 mini
@marvinfiori2541 7 วันที่ผ่านมา ⁺⁵
Mervin idea for next video: Model destilation!
@ngana8755 7 วันที่ผ่านมา ⁺²
Did you use 03-min low, medium or high? High is supposed to be better at answering the most difficult logical reasoning questions.
@EnglishImage 4 วันที่ผ่านมา
Gave this prompt to Deepseek, which reasoned through the logic correctly. Surprised and disappointed o3 mini took such a blind turn in its "thinking".
@xhridhar 7 วันที่ผ่านมา ⁺¹⁰
That sucks . There was so much hype over this .
@xXWillyxWonkaXx 7 วันที่ผ่านมา ⁺⁸
I dont care honestly, lets just hope DeepSeek copies the upcoming O3 and releases it lol
@micbab-vg2mu 6 วันที่ผ่านมา
Agree
@InAMinute-ws3yv 6 วันที่ผ่านมา
It solved my automation problem. But there is no image upload, so not useful for me.
@Swooshii-u4e 6 วันที่ผ่านมา
You should’ve also gave the question to v3 and r1
@Swooshii-u4e 6 วันที่ผ่านมา
How do you use v3 and r1 in combo?
@RadiantNij 6 วันที่ผ่านมา
My experience with it in coding has been great so far
@vivekkarumudi 6 วันที่ผ่านมา
Thanks the The very first question gave me a very clear idea that it is a failed one o3 model
@monstrositylabs 7 วันที่ผ่านมา ⁺³
IMO it's way better than o1 for coding. 01 preview for some weird reason is still the best
@micbab-vg2mu 6 วันที่ผ่านมา ⁺⁵
03-mini it is faster, cheaper but dummer version of o1 - I do not see why I should use it - now I have R1.
@MS-wz9jm 7 วันที่ผ่านมา ⁺³
Try Kimi 1.5 reasoning
@RadiantNij 6 วันที่ผ่านมา ⁺²
That clickbait title thooo
@everydaybob 6 วันที่ผ่านมา ⁺¹
Well, unfortunately o3-mini-high is a joke. o1-pro is so much better in reasoning.What they are claiming is totally false it looks like.
@roodood 4 วันที่ผ่านมา
o3-mini was useless for my agent, as it consistently failed to call any functions. Even when hardcoding the tool choice, it just responded with the function arguments as json string.
@gjadams74 6 วันที่ผ่านมา
"Imagine five dead people" tonight for the first time hearing your tests I saw this or heard this differently.
What if the prompt could be "five deceased people"
@gjadams74 6 วันที่ผ่านมา ⁺¹
So often in English there is a use of the word "dead" less as an adjective
As in "already dead" like the mice in a cage with a cat inside.
@faustprivate 6 วันที่ผ่านมา ⁺¹
100% this. Give dead ppl could imply they will die since they are on the track and the train is coming towards them. It just assumes your English is so so 😅
@幼女 7 วันที่ผ่านมา ⁺¹
Your coding test has different problems for each video, which makes it hard to follow.
@JavedAlam24 6 วันที่ผ่านมา
Your perspective on the trolley problem is not the only valid interpretation
@celtiberian 4 วันที่ผ่านมา
You're wrong. Read the prompt again.
@islambaraka6552 7 วันที่ผ่านมา ⁺¹
Bad model for my testing as well 🤔
@md.zunaidtausif-vj9cy 6 วันที่ผ่านมา
10x better than deep shiit
@mattki-y9y 6 วันที่ผ่านมา ⁺¹
Lool, cope more
@sephirothcloud3953 7 วันที่ผ่านมา
This model was trained with STEM in mind, it sucks in everything else
@gjadams74 6 วันที่ผ่านมา
@@sephirothcloud3953 perhaps another example of how it should be STEAM having Arts included. Likely da Vinci would agree. One step closer we are however
@fabianmunoz7365 7 วันที่ผ่านมา
WOW this model is a beast.
@Swooshii-u4e 6 วันที่ผ่านมา
This video sucks tho because it didn’t compare to deep
@avi7278 6 วันที่ผ่านมา
Deepseek is dog water in comparison. It's literally terrible at golang. You can tell it's a distilled sonnet model that mostly focused on prompt kiddy languages and frameworks. It tried to make an optional parameter in golang by adding an int parameter without a pointer and then saying if x = 0, then set a default value. I can't even begin to explain how many things are wrong with that. O3 refactored an entire feature in my go app in one shot where even o1 pro missed things after five minutes of thinking. But o3 one shots the whole thing in 10 seconds, and then even roasted me for the original code. o3 gives the same vibes as the jump we saw from sonnet 3.5 but even more so.
@mrpro7737 7 วันที่ผ่านมา ⁺¹
I will stick with deepseek, intel they release o3

ต่อไป

เล่นอัตโนมัติ

Apple's storage pricing is INSANE! Do this instead.

Apple's storage pricing is INSANE! Do this instead.

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

NVIDIA's $249 Secret Weapon for Edge AI - Jetson Orin Nano Super: Driveway Monitor

NVIDIA's $249 Secret Weapon for Edge AI - Jetson Orin Nano Super: Driveway Monitor

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

แหกหน้าพ่อค้าจีน 2 #hagatestudio #fun #funny #พากย์นรก

แหกหน้าพ่อค้าจีน 2 #hagatestudio #fun #funny #พากย์นรก

เนื้อเรื่องที่ท่านจะโมโหจนน้ำตาไหล | Mouthwashing

เนื้อเรื่องที่ท่านจะโมโหจนน้ำตาไหล | Mouthwashing

We made MKBHD's Dream Phone

We made MKBHD's Dream Phone

I tested the $2000 RTX 5090 Graphics Card

I tested the $2000 RTX 5090 Graphics Card

I BOUGHT A FLOOD DAMAGED ROLLS ROYCE CULLINAN & REBUILT IT IN 7 DAYS

I BOUGHT A FLOOD DAMAGED ROLLS ROYCE CULLINAN & REBUILT IT IN 7 DAYS

DeepSeek on Apple Silicon in depth | 4 MacBooks Tested

DeepSeek on Apple Silicon in depth | 4 MacBooks Tested

Scrape Any Website for FREE Using DeepSeek & Crawl4AI

Scrape Any Website for FREE Using DeepSeek & Crawl4AI

NVIDIA CEO Jensen Huang's Vision for the Future

NVIDIA CEO Jensen Huang's Vision for the Future

I Bought a $5000 PC in a Random Asian Tech Mall

I Bought a $5000 PC in a Random Asian Tech Mall

o3-mini is the FIRST DANGEROUS Autonomy Model | INSANE Coding and ML Abilities

o3-mini is the FIRST DANGEROUS Autonomy Model | INSANE Coding and ML Abilities

The Man Behind DeepSeek (Liang Wenfeng)

The Man Behind DeepSeek (Liang Wenfeng)

PiXXiE - Pick A Card | OFFICIAL M/V

PiXXiE - Pick A Card | OFFICIAL M/V

🔴LIVE โหนกระแส บาร์โฮสสะเทือน!!! "สุนิสา" อาละวาดไล่หลอกเงิน

🔴LIVE โหนกระแส บาร์โฮสสะเทือน!!! "สุนิสา" อาละวาดไล่หลอกเงิน

🔴LIVE สด! PGC 2024 ศึกชิงแชมป์โลกพับจี Circuit 3 วันที่ 2

🔴LIVE สด! PGC 2024 ศึกชิงแชมป์โลกพับจี Circuit 3 วันที่ 2

人是不能做到吗？#火影忍者 #家人 #佐助

人是不能做到吗？#火影忍者 #家人 #佐助

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

ช้างศึกโดนก่อน ไล่ยิงคืนสิงคโปร์ ทะลุน็อคเอาท์

ช้างศึกโดนก่อน ไล่ยิงคืนสิงคโปร์ ทะลุน็อคเอาท์

แพนด้าจะไม่ทน #cartoon #cartoonnetwork #short

แพนด้าจะไม่ทน #cartoon #cartoonnetwork #short

ทำผิดกฏหมาย 100 ข้อ ในวันเดียว!!

ทำผิดกฏหมาย 100 ข้อ ในวันเดียว!!