In the future, we’re gonna be paying $3000 a month for an AI that can perfectly implement any feature instantly, the perfect coding AI today will be worth a lot
@@StopWork-ai yeah bru, just a little detail: LLMs will never solve what you're saying. We need a completely different technology, this isn't AI at all, it's a probabilistic thingy that just tries to guess what you looking for based on what you typed. There's no reasoning, not even in what they call chain of thought because these aren't thoughts at all, are literally glorified searches in a huge set of training data
Mega-useful and very articulate overview thanks Steve
วันที่ผ่านมา +8
Great overview. Thanks for trying it out for us. I for one would like to see Devin succeed independently and who knows, IDE agent and independent agent could be a good combo in the future. So that you don't have to be at the desk all the time.
I used cursor for a whole day to create a chrome extension, it did the basic structure, but after my code exceeded 400 lines, it totally wasted a whole damn day trying to fix bugs here and there, finally i gave up, i started from scratch, and it took me 2 hours to do this by my self with some simple AI help.
I found your channel via a TH-cam search. Thanks for sharing the review. I believe there is a lot of room for Devin to grow. What are your thoughts on whether this will impact freshmen and sophomores looking to get internships? Please share more details in the future
Thanks for the review. Devin programs like a confident idiot in this example, which is basically the scariest combination to have in a human developer. I've started treating my LLM codegen like PRs by keeping the scope of a single LLM change as small and focused as possible, validating as I go. It's so much easier instead of waiting for something like Devin to build an entire feature that you then have to then spelunk through. Both workflows still require a competent human, but reviewing PRs is always more challenging than having the iterative feedback loop of the development environment.
This is the biggest problem with AI agents. They degenerate and become stuck, so you either waste an insane amount of time conversing with them through this which has a very low chance of succeeding or you drop the problem altogether. It's alright for low hanging fruit like setting up a repo like it says on the instructions but the first roadblock that's nontrivial the foundational model faces, it just completely stops functioning.
Really helpful, thanks for sharing. I agree with your take, it's much more comfortable to use Cursor. But that's because I'm a competent coder already. I guess there will be a new breed of non-coders that will barely even look at the code Devin generates, and instead just focus on that being their dev team, same as it always was (eg, a product manager). Still a ways to go, of course.
just a warning: i bet half of you reading this are leaking your sensitive keys to cursor. mine had access to my .env and .cursorignore, even with the correct privacy settings enabled. turning off auto-context for chat seemed to work, but not sure about composer.
*Introduction to Devon AI Coding Agent* (00:00:00) *Devon's Features and Capabilities* (00:00:56) *Challenges with Devon* (00:02:10) *Comparative Analysis: Devon vs Cursor Agents* (00:05:05) *Opinions on AI Tool Development* (00:08:11)
I believe Windsurf is much much better than Cursor, but I haven't used the paid version of Cursor. I like how Windsurf changes the files and you don't have to 'Accept', you can just test your new code and commit. In cursor you still have to click a button to 'Accept all' or accept some, it's a slight nitpick but it matters a lot if you are doing repetitive tasks like copy pasting screenshots to recreate certain UI. I also feel like Windsurf has better context awareness as I can say stuff like '...recreate this UI and use the same state management logic as in file @File1...' and Windsurf just understands what things to pull from context. Cursor on the other hand is quite bad, and you need to constantly spoon feed it. Only slight problem with Windsurf is the 500 chat limit per month but you can still use Cascade base which is surprisingly good compared to Sonnet.
I don't think so. If it will become the quality of a junior dev, then it's worth a couple thousands per month, if not even more as you can run it 24/7.
They aren't so much being bold as covering costs. The amount of LLM requests needed to sustain that is likely a ridiculous amount. But I don't even know if $500 would cover it in a week.
oh wow. such a shock. i never heard of this idea before. no one will ever charge $999 for a stupid mobile app. no one will ever try to charge $2000/month for something availabe for free. no one will ever charge a monthly fee to enable seat warming. shock all around....
Try calling ChatGPT api over and over again recursively. The costs add up quickly. I doubt the price they set is even the final price they will need to charge.
the guy behind Devin is literally a math prodigy, and the best one at that, check his video of when he was young competing in a math show, insane speed, reading and calculation abilities. a bit weird why focuses on this, and not more fundamental problems in AI/Math.
wow, not even a trial period of like a week ($50) or something, full $500 to see if this thing actually works. Thanks for the video though, really helpful and informative
There's a logical concern about Devin AI's performance. If Devin is used in its own development process, and if its performance is not exceptional, this inherently questions the capability of the Devin AI that was used to develop itself. This creates a circular problem: how can we trust the development process of Devin if the tool used for its development (Devin itself) shows limitations in its performance?
Who in their right mind would be willing to pay $500 a month for an LLM, knowing that it’s likely to start hallucinating once the tasks get even somewhat complex? It just doesn’t seem worth the price when you consider how often these models struggle with anything more than basic requests. Sure, they’re powerful in their own right, but at that price point, shouldn’t we expect more consistent performance? I mean, is the convenience really worth the risk of errors and inaccuracies? I’d love to hear from anyone who’s actually using this-do you find it’s delivering value for the money, or is it just another overpriced tool?
What’s funny here is that the price tag is not the value of the product (DEVIN), but the value they are trying to make investors believe. LLMs are stagnating, so they are desperately trying to paint 2025 as “the year of LLM agents.” Funny enough.
Ouch, 500 bucks wasted. I got the personal access for 50 bucks, but I still can’t connect Devin to my VSC or Slack. Whenever I try to create something, Devin just gets stuck in loops and stops working. To make things worse, customer service was no help at all-they provided no information or assistance.
100% correct, they still haven't fixed the issues with reasoning and hallucination. It's better to train our own data on open-source LLMs rather than relying on big closed-source platforms like Slack.
@@jayhu6075 you can use devin without slack too, but setup without slack is intentionally kept a bit convoluted to attract mid size software houses rather than non-tech people and businesses
LMFAO a 'Slack based workflow' DEAD before they even start. Like MultiOn. Insanely funded teams that cant ship and find ridulous obstacles. Slack is trash. Good video Sub'd tnx
you can actually use it without slack, they kept it purposefully convoluted while they scale up the infra (each devin VM gets an equivalent of a desktop PC's resources, hence it's understandable)
who is crazy enough to pay $500/month for LLM, knowing they will hallucinate once the tasks get moderately complex
In the future, we’re gonna be paying $3000 a month for an AI that can perfectly implement any feature instantly, the perfect coding AI today will be worth a lot
@@StopWork-ai LLMs are stagnating, even Claude can't do anything that remotely needs any logical thinking.
Companies
TH-camrs making demos 😂
@@StopWork-ai yeah bru, just a little detail: LLMs will never solve what you're saying. We need a completely different technology, this isn't AI at all, it's a probabilistic thingy that just tries to guess what you looking for based on what you typed. There's no reasoning, not even in what they call chain of thought because these aren't thoughts at all, are literally glorified searches in a huge set of training data
Mega-useful and very articulate overview thanks Steve
Great overview. Thanks for trying it out for us. I for one would like to see Devin succeed independently and who knows, IDE agent and independent agent could be a good combo in the future. So that you don't have to be at the desk all the time.
I used cursor for a whole day to create a chrome extension, it did the basic structure, but after my code exceeded 400 lines, it totally wasted a whole damn day trying to fix bugs here and there, finally i gave up, i started from scratch, and it took me 2 hours to do this by my self with some simple AI help.
This was great! Love the comparisons between Devin and Cursor. Cursor fits very well into my workflow but there are still some quirks.
thanks for saving my five hundred bucks🎉
now give it to him
I found your channel via a TH-cam search. Thanks for sharing the review. I believe there is a lot of room for Devin to grow. What are your thoughts on whether this will impact freshmen and sophomores looking to get internships? Please share more details in the future
I think the start salary for the intern or junior developer is 500
Thanks for the review. Devin programs like a confident idiot in this example, which is basically the scariest combination to have in a human developer. I've started treating my LLM codegen like PRs by keeping the scope of a single LLM change as small and focused as possible, validating as I go. It's so much easier instead of waiting for something like Devin to build an entire feature that you then have to then spelunk through. Both workflows still require a competent human, but reviewing PRs is always more challenging than having the iterative feedback loop of the development environment.
Devin has an IDE plugin too (accessible to personal and enterprise tiers, not usable in team tier due to how they set up team tier)
Vs windsurf 10/m
At 2:50 devin used lovable to generate the web page instead of generating itself?
it sometimes tries to purposefully inflate compute usage on your and their side
This is the biggest problem with AI agents. They degenerate and become stuck, so you either waste an insane amount of time conversing with them through this which has a very low chance of succeeding or you drop the problem altogether. It's alright for low hanging fruit like setting up a repo like it says on the instructions but the first roadblock that's nontrivial the foundational model faces, it just completely stops functioning.
Really helpful, thanks for sharing. I agree with your take, it's much more comfortable to use Cursor. But that's because I'm a competent coder already. I guess there will be a new breed of non-coders that will barely even look at the code Devin generates, and instead just focus on that being their dev team, same as it always was (eg, a product manager). Still a ways to go, of course.
just a warning: i bet half of you reading this are leaking your sensitive keys to cursor. mine had access to my .env and .cursorignore, even with the correct privacy settings enabled. turning off auto-context for chat seemed to work, but not sure about composer.
*Introduction to Devon AI Coding Agent* (00:00:00)
*Devon's Features and Capabilities* (00:00:56)
*Challenges with Devon* (00:02:10)
*Comparative Analysis: Devon vs Cursor Agents* (00:05:05)
*Opinions on AI Tool Development* (00:08:11)
And cursor vs windsurf?
Tried windsurf, it just feels different, it’s too slow. Cursor I way better
All that hype for an AI slack bot
I believe Windsurf is much much better than Cursor, but I haven't used the paid version of Cursor. I like how Windsurf changes the files and you don't have to 'Accept', you can just test your new code and commit. In cursor you still have to click a button to 'Accept all' or accept some, it's a slight nitpick but it matters a lot if you are doing repetitive tasks like copy pasting screenshots to recreate certain UI. I also feel like Windsurf has better context awareness as I can say stuff like '...recreate this UI and use the same state management logic as in file @File1...' and Windsurf just understands what things to pull from context. Cursor on the other hand is quite bad, and you need to constantly spoon feed it. Only slight problem with Windsurf is the 500 chat limit per month but you can still use Cascade base which is surprisingly good compared to Sonnet.
Would like to see windsurf and cursor comparison. Thanks for the review
Thanks for this honest non-overhyping review.
I think devin target audience is managers, ceos and investors. Whereas, cursor target audience is developers
from my experience with devin, you'd still want someone with more-than-end-user programming knowledge to prompt devin
BOLD PRICE.
If you're bold enough to charge $500 a month, your service better be perfect.
I don't think so. If it will become the quality of a junior dev, then it's worth a couple thousands per month, if not even more as you can run it 24/7.
They aren't so much being bold as covering costs. The amount of LLM requests needed to sustain that is likely a ridiculous amount. But I don't even know if $500 would cover it in a week.
it's more like $1100/month if you work it full time, it's $50/month+credits or $500/month+credits
I can’t believe these companies are trying to charge us 200 or 500 a month for these new AI gimmicks
oh wow. such a shock. i never heard of this idea before. no one will ever charge $999 for a stupid mobile app. no one will ever try to charge $2000/month for something availabe for free. no one will ever charge a monthly fee to enable seat warming. shock all around....
Capitalism. Is perfect. Dont like, dont pay,
They convinced VCs to invest millions. They need to generate profits somehow.
u do understand that average basement bedroom Joe is NOT their target audience right? U DO understand that there are things called COMPANIES right?
Try calling ChatGPT api over and over again recursively. The costs add up quickly. I doubt the price they set is even the final price they will need to charge.
the guy behind Devin is literally a math prodigy, and the best one at that, check his video of when he was young competing in a math show, insane speed, reading and calculation abilities. a bit weird why focuses on this, and not more fundamental problems in AI/Math.
coz this is wayyyy more marketable
Cash grab.
because **computer programming is applied math**
I agree with the user comments on Rumble; it's unreasonable to pay for this when they still haven't fixed the issues with reasoning and hallucination.
Another subscriber here! Your video is excellent and very technical.
Cline seems WAY better, and you get a price report for every single comment.
Super interesting thanks!
Well done. Instrumental analysis.
Be honest, most PMs would love this shit.
Great review! Devin will be considered Fraud for what they did to get that 2B val.
wow, not even a trial period of like a week ($50) or something, full $500 to see if this thing actually works. Thanks for the video though, really helpful and informative
Can I put 200$ openAi as a project manager + 500$ Devin team together?
One is intended to make YOU more productive and one is trying to be a Jr Dev. Its all very interesting. Almost hard to remember 2015 at this point.
can you compare with junior frontend developer too?
There's a logical concern about Devin AI's performance. If Devin is used in its own development process, and if its performance is not exceptional, this inherently questions the capability of the Devin AI that was used to develop itself. This creates a circular problem: how can we trust the development process of Devin if the tool used for its development (Devin itself) shows limitations in its performance?
How did you get access?? This is from cognition?
The way i see it Devin is not really intended for Tech team. More to a business manager or PM. And cursor is aimed directly for Devs...
Wonder what is best value, devin vs chatgpt pro. Next up to test?
both use openai infra
Can I pull the code? AI: hallucinates pull request lmao, so good!
For $500/month you might as well build a super-computer and run your own local models
did you just wakeup from a long cave sleep? a single H100 is about $30k dude
@@hqcart1 It's a joke, chillout dude. Of course I'm aware how expensive those GPUs are
Nice review. Also, bring YouTuve chapters into the videos. Thanks.
Ain't Devin that one company that faked a demo about building an app or something ?
Who in their right mind would be willing to pay $500 a month for an LLM, knowing that it’s likely to start hallucinating once the tasks get even somewhat complex? It just doesn’t seem worth the price when you consider how often these models struggle with anything more than basic requests. Sure, they’re powerful in their own right, but at that price point, shouldn’t we expect more consistent performance? I mean, is the convenience really worth the risk of errors and inaccuracies? I’d love to hear from anyone who’s actually using this-do you find it’s delivering value for the money, or is it just another overpriced tool?
It's too early for Devin. It's the timing for Cursor AI. Maybe 1 year from now Devin will be good enough but by then.
Cursor can eat its lunch.
Thanks for ensuring me that my job will exist for a few more years at least XD
thanks. i was about to buy devin. saved me
What’s funny here is that the price tag is not the value of the product (DEVIN), but the value they are trying to make investors believe. LLMs are stagnating, so they are desperately trying to paint 2025 as “the year of LLM agents.” Funny enough.
Ouch, 500 bucks wasted. I got the personal access for 50 bucks, but I still can’t connect Devin to my VSC or Slack. Whenever I try to create something, Devin just gets stuck in loops and stops working. To make things worse, customer service was no help at all-they provided no information or assistance.
Alternatively, I can hire Rajesh for $200 for the entire month to serve as my coding assistant and manage my code.
They have a premium version for 5000 usd per month - maybe you should try that.
Devin Enterprise?
you like cursor because you too smart
Thanks, your videos always have high signal to noise ratio.
Until de AI coders work so good that they know what we need better than us.... control-oriented-ui beats trust-oriented-ui. We need trust + verify. :D
Who’s that crazy dev paying for a $500 slack bot
100% correct, they still haven't fixed the issues with reasoning and hallucination.
It's better to train our own data on open-source LLMs rather than relying on big closed-source platforms like Slack.
@@jayhu6075 you can use devin without slack too, but setup without slack is intentionally kept a bit convoluted to attract mid size software houses rather than non-tech people and businesses
$500/month is nuts
it's actually $500/month+credits or $50/month+credits in case of the personal tier that currently doesn't accept new signups
So 500 for something without ui and is on the same level of other llm wtf!?
$500 a month...😂😂😂😂
500 a month what a scam
100% right, they still haven't fixed the issues with reasoning and hallucination.
That sounds overpriced given most AI coders are between 10-20 USD/mo.
Devin performs like 40 percentile junior dev, that's the difference
kevinmathscience💀
COOL AGENT SIR
Cursor rules!!!
❤
You bashed Devin pretty good 6 months ago - directly saying they were lying.
So, which one is it ? They good, bad ?
Slack based. Lmao.
LMFAO a 'Slack based workflow' DEAD before they even start. Like MultiOn. Insanely funded teams that cant ship and find ridulous obstacles. Slack is trash. Good video Sub'd tnx
you can actually use it without slack, just that they keep that method purposefully convoluted to access
i cant tell if your voice is AI or not.
Devin is the mot shitty scam out their 🚢
In other words... it's a waste of time and money.
lol 500 bucks a month no..... Your done
You lost me at, "It's primarily a slack-based workflow." Sorry... but you couldn't pay me 500$ a month to use that.
you can actually use it without slack, they kept it purposefully convoluted while they scale up the infra (each devin VM gets an equivalent of a desktop PC's resources, hence it's understandable)
this is an ad 👎
The way i see it Devin is not really intended for Tech team. More to a business manager or PM. And cursor is aimed directly for Devs....
i'm a devin subscriber and can tell it's clearly aimed at midsize software houses (40-500 devs)