who is crazy enough to pay $500/month for LLM, knowing they will hallucinate once the tasks get moderately complex
@StopWork-aiКүн бұрын
In the future, we’re gonna be paying $3000 a month for an AI that can perfectly implement any feature instantly, the perfect coding AI today will be worth a lot
@EverRustingКүн бұрын
@@StopWork-ai LLMs are stagnating, even Claude can't do anything that remotely needs any logical thinking.
@wanjohiКүн бұрын
Companies
@davidrempel433Күн бұрын
KZbinrs making demos 😂
@mattb925Күн бұрын
@@StopWork-ai yeah bru, just a little detail: LLMs will never solve what you're saying. We need a completely different technology, this isn't AI at all, it's a probabilistic thingy that just tries to guess what you looking for based on what you typed. There's no reasoning, not even in what they call chain of thought because these aren't thoughts at all, are literally glorified searches in a huge set of training data
@aadlrКүн бұрын
Mega-useful and very articulate overview thanks Steve
Күн бұрын
Great overview. Thanks for trying it out for us. I for one would like to see Devin succeed independently and who knows, IDE agent and independent agent could be a good combo in the future. So that you don't have to be at the desk all the time.
@hqcart18 сағат бұрын
I used cursor for a whole day to create a chrome extension, it did the basic structure, but after my code exceeded 400 lines, it totally wasted a whole damn day trying to fix bugs here and there, finally i gave up, i started from scratch, and it took me 2 hours to do this by my self with some simple AI help.
@danielfitchomahaКүн бұрын
This was great! Love the comparisons between Devin and Cursor. Cursor fits very well into my workflow but there are still some quirks.
@cyberchenКүн бұрын
thanks for saving my five hundred bucks🎉
@magnetsec14 сағат бұрын
now give it to him
@オショニックКүн бұрын
I found your channel via a KZbin search. Thanks for sharing the review. I believe there is a lot of room for Devin to grow. What are your thoughts on whether this will impact freshmen and sophomores looking to get internships? Please share more details in the future
@waynej_xyzКүн бұрын
I think the start salary for the intern or junior developer is 500
@ScottimusPrime-j3qКүн бұрын
Thanks for the review. Devin programs like a confident idiot in this example, which is basically the scariest combination to have in a human developer. I've started treating my LLM codegen like PRs by keeping the scope of a single LLM change as small and focused as possible, validating as I go. It's so much easier instead of waiting for something like Devin to build an entire feature that you then have to then spelunk through. Both workflows still require a competent human, but reviewing PRs is always more challenging than having the iterative feedback loop of the development environment.
@erkinalp10 сағат бұрын
Devin has an IDE plugin too (accessible to personal and enterprise tiers, not usable in team tier due to how they set up team tier)
@MiguelCardoso-k1cКүн бұрын
At 2:50 devin used lovable to generate the web page instead of generating itself?
@erkinalp10 сағат бұрын
it sometimes tries to purposefully inflate compute usage on your and their side
@FrankWildOfficial8 сағат бұрын
Vs windsurf 10/m
@evanfutureКүн бұрын
Really helpful, thanks for sharing. I agree with your take, it's much more comfortable to use Cursor. But that's because I'm a competent coder already. I guess there will be a new breed of non-coders that will barely even look at the code Devin generates, and instead just focus on that being their dev team, same as it always was (eg, a product manager). Still a ways to go, of course.
@jambalaya974Күн бұрын
This is the biggest problem with AI agents. They degenerate and become stuck, so you either waste an insane amount of time conversing with them through this which has a very low chance of succeeding or you drop the problem altogether. It's alright for low hanging fruit like setting up a repo like it says on the instructions but the first roadblock that's nontrivial the foundational model faces, it just completely stops functioning.
@nickwoodward819Күн бұрын
just a warning: i bet half of you reading this are leaking your sensitive keys to cursor. mine had access to my .env and .cursorignore, even with the correct privacy settings enabled. turning off auto-context for chat seemed to work, but not sure about composer.
@kevinduigou321219 сағат бұрын
*Introduction to Devon AI Coding Agent* (00:00:00) *Devon's Features and Capabilities* (00:00:56) *Challenges with Devon* (00:02:10) *Comparative Analysis: Devon vs Cursor Agents* (00:05:05) *Opinions on AI Tool Development* (00:08:11)
@edanincКүн бұрын
And cursor vs windsurf?
@stranger_b1Күн бұрын
Tried windsurf, it just feels different, it’s too slow. Cursor I way better
@e404Күн бұрын
All that hype for an AI slack bot
@SahilP264815 сағат бұрын
I believe Windsurf is much much better than Cursor, but I haven't used the paid version of Cursor. I like how Windsurf changes the files and you don't have to 'Accept', you can just test your new code and commit. In cursor you still have to click a button to 'Accept all' or accept some, it's a slight nitpick but it matters a lot if you are doing repetitive tasks like copy pasting screenshots to recreate certain UI. I also feel like Windsurf has better context awareness as I can say stuff like '...recreate this UI and use the same state management logic as in file @File1...' and Windsurf just understands what things to pull from context. Cursor on the other hand is quite bad, and you need to constantly spoon feed it. Only slight problem with Windsurf is the 500 chat limit per month but you can still use Cascade base which is surprisingly good compared to Sonnet.
@bnjmn777917 сағат бұрын
Thanks for this honest non-overhyping review.
@goonie7913 сағат бұрын
Would like to see windsurf and cursor comparison. Thanks for the review
@flosset607022 сағат бұрын
I think devin target audience is managers, ceos and investors. Whereas, cursor target audience is developers
@erkinalp10 сағат бұрын
from my experience with devin, you'd still want someone with more-than-end-user programming knowledge to prompt devin
@poisondnaКүн бұрын
BOLD PRICE. If you're bold enough to charge $500 a month, your service better be perfect.
@1879heikkisorsa20 сағат бұрын
I don't think so. If it will become the quality of a junior dev, then it's worth a couple thousands per month, if not even more as you can run it 24/7.
@genericdeveloper396614 сағат бұрын
They aren't so much being bold as covering costs. The amount of LLM requests needed to sustain that is likely a ridiculous amount. But I don't even know if $500 would cover it in a week.
@erkinalp10 сағат бұрын
it's more like $1100/month if you work it full time, it's $50/month+credits or $500/month+credits
@coolmcdudeКүн бұрын
I can’t believe these companies are trying to charge us 200 or 500 a month for these new AI gimmicks
@drunktrump5209Күн бұрын
oh wow. such a shock. i never heard of this idea before. no one will ever charge $999 for a stupid mobile app. no one will ever try to charge $2000/month for something availabe for free. no one will ever charge a monthly fee to enable seat warming. shock all around....
@joedomatКүн бұрын
Capitalism. Is perfect. Dont like, dont pay,
@stranger_b1Күн бұрын
They convinced VCs to invest millions. They need to generate profits somehow.
@thebicycleman806221 сағат бұрын
u do understand that average basement bedroom Joe is NOT their target audience right? U DO understand that there are things called COMPANIES right?
@genericdeveloper396614 сағат бұрын
Try calling ChatGPT api over and over again recursively. The costs add up quickly. I doubt the price they set is even the final price they will need to charge.
@J3R3MI62 минут бұрын
I think we also half to consider GPT-5 (api) is right around corner
@s4ussКүн бұрын
the guy behind Devin is literally a math prodigy, and the best one at that, check his video of when he was young competing in a math show, insane speed, reading and calculation abilities. a bit weird why focuses on this, and not more fundamental problems in AI/Math.
@thebicycleman806221 сағат бұрын
coz this is wayyyy more marketable
@JustVibe44413 сағат бұрын
Cash grab.
@erkinalp10 сағат бұрын
because **computer programming is applied math**
@vh5x7Күн бұрын
Another subscriber here! Your video is excellent and very technical.
@jayhu607521 сағат бұрын
I agree with the user comments on Rumble; it's unreasonable to pay for this when they still haven't fixed the issues with reasoning and hallucination.
@mitchellrcohenКүн бұрын
Super interesting thanks!
@TreeLuvBurdpuКүн бұрын
Cline seems WAY better, and you get a price report for every single comment.
@guidedlabs42417 сағат бұрын
Well done. Instrumental analysis.
@MrRobot-gm9cvКүн бұрын
Great review! Devin will be considered Fraud for what they did to get that 2B val.
@ArcticwhirКүн бұрын
wow, not even a trial period of like a week ($50) or something, full $500 to see if this thing actually works. Thanks for the video though, really helpful and informative
@igorshingelevich7627Күн бұрын
Can I put 200$ openAi as a project manager + 500$ Devin team together?
@firesoul453Күн бұрын
One is intended to make YOU more productive and one is trying to be a Jr Dev. Its all very interesting. Almost hard to remember 2015 at this point.
@shaceabКүн бұрын
There's a logical concern about Devin AI's performance. If Devin is used in its own development process, and if its performance is not exceptional, this inherently questions the capability of the Devin AI that was used to develop itself. This creates a circular problem: how can we trust the development process of Devin if the tool used for its development (Devin itself) shows limitations in its performance?
@jungbtcКүн бұрын
can you compare with junior frontend developer too?
@caseystar_Күн бұрын
How did you get access?? This is from cognition?
@carloslfu14 сағат бұрын
Be honest, most PMs would love this shit.
@ronnybruknappКүн бұрын
Wonder what is best value, devin vs chatgpt pro. Next up to test?
@erkinalp10 сағат бұрын
both use openai infra
@BernardoKlopffleisch11 сағат бұрын
The way i see it Devin is not really intended for Tech team. More to a business manager or PM. And cursor is aimed directly for Devs...
@Qefx19 сағат бұрын
Can I pull the code? AI: hallucinates pull request lmao, so good!
@TheCreativeNickКүн бұрын
For $500/month you might as well build a super-computer and run your own local models
@hqcart18 сағат бұрын
did you just wakeup from a long cave sleep? a single H100 is about $30k dude
@TheCreativeNick7 сағат бұрын
@@hqcart1 It's a joke, chillout dude. Of course I'm aware how expensive those GPUs are
@rohovdmytroКүн бұрын
Nice review. Also, bring YouTuve chapters into the videos. Thanks.
@LePhenixGDКүн бұрын
Ain't Devin that one company that faked a demo about building an app or something ?
@ccerrato147Күн бұрын
It's too early for Devin. It's the timing for Cursor AI. Maybe 1 year from now Devin will be good enough but by then. Cursor can eat its lunch.
@youareawonderfulman6 сағат бұрын
Who in their right mind would be willing to pay $500 a month for an LLM, knowing that it’s likely to start hallucinating once the tasks get even somewhat complex? It just doesn’t seem worth the price when you consider how often these models struggle with anything more than basic requests. Sure, they’re powerful in their own right, but at that price point, shouldn’t we expect more consistent performance? I mean, is the convenience really worth the risk of errors and inaccuracies? I’d love to hear from anyone who’s actually using this-do you find it’s delivering value for the money, or is it just another overpriced tool?
@AJvanuwКүн бұрын
Thanks for ensuring me that my job will exist for a few more years at least XD
@treksis16 сағат бұрын
thanks. i was about to buy devin. saved me
@yutoriotsu884817 сағат бұрын
What’s funny here is that the price tag is not the value of the product (DEVIN), but the value they are trying to make investors believe. LLMs are stagnating, so they are desperately trying to paint 2025 as “the year of LLM agents.” Funny enough.
@cucciolo1825 сағат бұрын
Ouch, 500 bucks wasted. I got the personal access for 50 bucks, but I still can’t connect Devin to my VSC or Slack. Whenever I try to create something, Devin just gets stuck in loops and stops working. To make things worse, customer service was no help at all-they provided no information or assistance.
@my_name_is_ahadКүн бұрын
Alternatively, I can hire Rajesh for $200 for the entire month to serve as my coding assistant and manage my code.
@Rami_Elkady23 сағат бұрын
They have a premium version for 5000 usd per month - maybe you should try that.
@erkinalp10 сағат бұрын
Devin Enterprise?
@jamesxxxxxxКүн бұрын
you like cursor because you too smart
@JunYamogКүн бұрын
Thanks, your videos always have high signal to noise ratio.
@EsenEspinosa11 сағат бұрын
Until de AI coders work so good that they know what we need better than us.... control-oriented-ui beats trust-oriented-ui. We need trust + verify. :D
@MacS7nКүн бұрын
Who’s that crazy dev paying for a $500 slack bot
@jayhu607521 сағат бұрын
100% correct, they still haven't fixed the issues with reasoning and hallucination. It's better to train our own data on open-source LLMs rather than relying on big closed-source platforms like Slack.
@erkinalp10 сағат бұрын
@@jayhu6075 you can use devin without slack too, but setup without slack is intentionally kept a bit convoluted to attract mid size software houses rather than non-tech people and businesses
@cacogenicist17 сағат бұрын
$500/month is nuts
@erkinalp10 сағат бұрын
it's actually $500/month+credits or $50/month+credits in case of the personal tier that currently doesn't accept new signups
@edoardododoguzzi12 сағат бұрын
So 500 for something without ui and is on the same level of other llm wtf!?
@MaximillianHethКүн бұрын
$500 a month...😂😂😂😂
@im_chris_aiКүн бұрын
That sounds overpriced given most AI coders are between 10-20 USD/mo.
@erkinalp10 сағат бұрын
Devin performs like 40 percentile junior dev, that's the difference
@921EtherКүн бұрын
500 a month what a scam
@jayhu607521 сағат бұрын
100% right, they still haven't fixed the issues with reasoning and hallucination.
@TopCubyКүн бұрын
kevinmathscience💀
@Eriiiiiiiick22 сағат бұрын
COOL AGENT SIR
@tonywhite4476Күн бұрын
Cursor rules!!!
@MichealScott24Күн бұрын
❤
@Rami_Elkady23 сағат бұрын
You bashed Devin pretty good 6 months ago - directly saying they were lying. So, which one is it ? They good, bad ?
@SahilP264815 сағат бұрын
Slack based. Lmao.
@augmentos21 сағат бұрын
LMFAO a 'Slack based workflow' DEAD before they even start. Like MultiOn. Insanely funded teams that cant ship and find ridulous obstacles. Slack is trash. Good video Sub'd tnx
@erkinalp10 сағат бұрын
you can actually use it without slack, just that they keep that method purposefully convoluted to access
@randomdude2582Күн бұрын
i cant tell if your voice is AI or not.
@hamzameski-o3u19 сағат бұрын
Devin is the mot shitty scam out their 🚢
@schmetterling447716 сағат бұрын
In other words... it's a waste of time and money.
@wilhelmdebruyn8643Күн бұрын
lol 500 bucks a month no..... Your done
@CollabCrush20 сағат бұрын
You lost me at, "It's primarily a slack-based workflow." Sorry... but you couldn't pay me 500$ a month to use that.
@erkinalp10 сағат бұрын
you can actually use it without slack, they kept it purposefully convoluted while they scale up the infra (each devin VM gets an equivalent of a desktop PC's resources, hence it's understandable)
@christophhollmann20 сағат бұрын
this is an ad 👎
@BigBearBernie11 сағат бұрын
The way i see it Devin is not really intended for Tech team. More to a business manager or PM. And cursor is aimed directly for Devs....
@erkinalp10 сағат бұрын
i'm a devin subscriber and can tell it's clearly aimed at midsize software houses (40-500 devs)