No, actually im using the flash and it’s actually far far better that 4o , I use it for my science questions in physics and stuff and it’s really a lot lot better
Cos you can trick a human too with some of these and it wouldn't warrant the conclusion that the tricked human can't reason. If when alerted that it's a trick question it still can't do it then I'll probably agree about the seriousness of the issue. I've seen a couple videos when you've done this type of thing and seemingly concluded that "they can't reason", and I feel like that conclusion is not warranted.
Please can you be showing us what it does when you warn it that it's a trick question etc, whether it still gets stuck or not?
From my tests even 4o is smarter than Gemini 2.0 Flash. OpenAI has fixed many simple mistakes that Google has not yet.
No, actually im using the flash and it’s actually far far better that 4o , I use it for my science questions in physics and stuff and it’s really a lot lot better
I've met many humans who can't pass the misaligned attention test.
Cos you can trick a human too with some of these and it wouldn't warrant the conclusion that the tricked human can't reason. If when alerted that it's a trick question it still can't do it then I'll probably agree about the seriousness of the issue. I've seen a couple videos when you've done this type of thing and seemingly concluded that "they can't reason", and I feel like that conclusion is not warranted.
wow
Orion dropping tomorrow. Wait until you get a load of that model.
As long as it never learns to think like you... humanity is save.
it's capable, but not smart as o1
Its flash bro
Yhea it's like the equivalent of o1 mini I suppose
You can get even better results than 01, if you use an API and have it prompt itself back and forth.
yes!... and it's free.
@@NakedSageAstrology it's exterminatal and flash..
Not pro or ultra or specifically a separate reasoning model at all...
o1 pro is the 🔝
Okay moneybagg
so o1 did solve this problem ?
200$