Anthropic's SHOCKING New Model BREAKS the Software Industry! Claude 3.5 Sonnet Insane Coding Ability

Рет қаралды 109,666

Wes Roth

Күн бұрын

Пікірлер: 589

@paulyflynn 6 ай бұрын

so far, it has found a race condition, found a security bug, and created two performance optimizations in my rust code

@OriginalRaveParty 6 ай бұрын

🦀

@morespinach9832 6 ай бұрын

What prompts are you using

@ZaphodOddly 6 ай бұрын

Wow. Terrific!

@cassianomartin2699 6 ай бұрын

Nice.

@kyrylmelekhin2667 5 ай бұрын

Eventually it will tell you to get rid of rust 😂

@gubzs 6 ай бұрын

Even if LLMs can never reliably do anything but code, they will utterly transform the world.

@ZM-dm3jg 6 ай бұрын

I pulled myself out of the ghetto to the middle class by years of learning software development, and now they want to send me back to the ghetto by taking all the jobs. feelsbadman

@Weirdgeek83 6 ай бұрын

The irony is coding will be the first dead industry when so many people recommended it. The only safe jobs will be manual labor jobs or ones that require human senses (chef for example.)

@uw10isplaya 6 ай бұрын

Yeah idk how some people dismiss it outright. Like, what? Even if we got no new models, a whole new generation of startups would grow up around this tech, in addition to all its integration into businesses and consumers with Microsoft/Apple. But that's not the case; we're also getting incremental improvements to the state of the art model every few months.

@umaikeruna 6 ай бұрын

Mayve your hard work and intelligence pulled you out of the ghetto; virtues which you'll still have, however the landscape may change.

@tuiroakwood 6 ай бұрын

@@ZM-dm3jg learn the AI tools to make yourself a more efficient coder, you can be better than you've ever been by leveraging new tools and absolutely kick butt in your career

@drhxa 6 ай бұрын

Great video Wes, def your best one yet! This is the kind of video we love. Testing LLMs in practice, discussing implications and your impressions. You're absolutely right that this is a gamechanger!

@centurionstrengthandfitnes3694 6 ай бұрын

Great video... apart from all the weird deja vu you cause with your editing.

@EmeraldView 6 ай бұрын

I absolutely HATE this practice in KZbin videos. Is this something people like? The taking clips from the video and putting them at the front. So when you get to it again you're like "Did I accidentally rewind this video or hit the screen and skip back 15 minutes!? "

@Goldengeko123 6 ай бұрын

Small previews of whats to come in the video is fine the one in this video was a bit confusing and annoying. Great info/video otherwise.@EmeraldView

@personalgao 6 ай бұрын

A good feedback will be to use a filter at the beginning, even change the sound in a way so we understand is a preview of what is coming. Some videos put these cuts in black&white, or distort the sound with "radio" sounds... But is not my call, Wes should decide what he wants in his channel.

@SirHargreeves 6 ай бұрын

Agreed. I’m now watching the snake game section, in full, a second time. Why waste the viewers time like this?

@mistervanderveer 6 ай бұрын

@@EmeraldView im sure no human likes this, at all, whatsoever. its extremely annoying and confusing and cheap. but i guess the algo likes it.

@paulmuriithi7596 6 ай бұрын

AIGRID covered this model first, but wes did justice in a deeper perspective . Well done wes. Keep us posted

@SurfCatten 6 ай бұрын

Fantastic video you really add value in this increasingly crowded field of AI KZbinrs.

@MS-wz9jm 6 ай бұрын

The only thing we are missing with these AI models is a tool you just install in your coding software so that it can just create/update the files for you eliminating the copy and pasting.

@ilyavasylevsky3229 6 ай бұрын

Jetbrains AI Assistant is already doing that

@zariumsheridan3488 6 ай бұрын

@@ilyavasylevsky3229 and copilot plugin. Not impressed with copilot though.

@LearningLife77 6 ай бұрын

Copilot is exactly that. One of many

@JohnSmith762A11B 6 ай бұрын

Apple's new Swift Assist works right within Xcode, and is trained on all of Apple's in-house documentation and code. It's going to be a gamechanger for iOS/macOS/iPadOS/tvOS/watchOS developers. First release will be late summer/early fall. Personally, I don't care about little python apps but real apps that can run on a billion devices and be put in an App Store is genuinely exciting.

@LearnWithBahman 6 ай бұрын

Is this available for devs ?

@dimaquia6139 6 ай бұрын

As a Software industry, I'm shocked and broken

@imthinkingthoughts 6 ай бұрын

finally his title is actually relevant

@AirSandFire 6 ай бұрын

Hey, nice to meet you Software Industry, I am Jobs. I, too, feel some unease about the video; I feel threatened by it.

@justin.johnson 6 ай бұрын

WTF?

@HCG 6 ай бұрын

@@justin.johnson You must be a little slow

@swamix_dot_com 5 ай бұрын

@@AirSandFire Hi im steve jobs, the snake ate my apple and GPU company overtook me, what to do?

@kamelsf 6 ай бұрын

Best review i saw so far, thank you !

@privateerburrows 5 ай бұрын

I finally bit the bullet and subscribed to Claude Pro. Gee-wiz! Got many pages of code written today, with its help. A new Mandelbrot viewer I've been thinking about for a long time.

@gaiachild1461 6 ай бұрын

Crazy times, thanks for the sublime coverage and commentary dude

@TheRealHassan789 6 ай бұрын

This is one of your best videos. Especially the deeper coding examples of editing a preexisting GitHub code base

@6lack5ushi 6 ай бұрын

The Doom example is freaking WILD!!!

@ALFTHADRADDAD 6 ай бұрын

Nah yeah what the fuck

@JohnSmith762A11B 6 ай бұрын

If AI can take John Carmack's job...

@cluelesssoldier 6 ай бұрын

@@JohnSmith762A11B LMAO!

@Co-Monad 5 ай бұрын

I’m a software engineer and lead AI efforts at my current place of employment. This update is huge! Previously, you couldn’t get basic code or functionality from these models without them introducing regressions. Using AI to code is now becoming a real possibility. Excellent video.

@codejunki567 5 ай бұрын

They said this a year ago

@Co-Monad 5 ай бұрын

@@codejunki567 in all fairness it’s not practical yet. However, at least it seems like a possibility. I’m not sold we are there yet, it’s too reckless and dangerous right now.

@nigelcrasto 6 ай бұрын

This video was awesome 👍 You did a great job exploring the model and showing great easy to understand demos !

@drjpeg 5 ай бұрын

Awesome video Wes! Really enjoyed you walking us through using the latest model released with examples in real time instead of just talking about the way the model has improved like most AI KZbin channels. Thank you sir

@antigravityinc 5 ай бұрын

20:17 had to pause your video, but was happy to notice there’s way more! Sweet.

@1337bitcoin 5 ай бұрын

Thank you for these updates. I can't keep switching assistants all the time to keep up with who is best, but I'm excited to start work tomorrow and try this. It seems to be solving all the annoyances I've been having with GPT 4o

@rawleystanhope3251 6 ай бұрын

Great video, Wes. I like how you challenged the model with interesting tasks. I’ve grown pretty tired of videos other KZbinr’s std “rubric” tests

@davidbayliss3789 6 ай бұрын

Just on the strength of this video I've started an Anthropic subscription in addition to my long existing Open Ai one. No hesitation.

@robinvegas4367 5 ай бұрын

I'm right behind you. This was impressive

@skeyenett 6 ай бұрын

Feel the AGI

@imthinkingthoughts 6 ай бұрын

yep

@notaras1985 6 ай бұрын

Nowhere near AGI. All those cheap tricks are just statistics on steroids

@spectralstreamer 6 ай бұрын

@@notaras1985 Its not just statistics, it is also propability, analysis and linear algebra on steroids. So why do you think AGI cannot be achieved by math and actuators and sensor?

@cassianomartin2699 6 ай бұрын

Not AGI. Not even near. Hardly doubt this will be possible using only code, like a human brain which is chemically/emotionally controled. A machine still misses this.

@notaras1985 6 ай бұрын

@@spectralstreamer because soul, biochemistry and quantum phenomena

@Ikbeneengeit 6 ай бұрын

Maybe not hitting a wall yet, but LLMs have yet to prove they can synthesise new insights from diverse data. It just can do what it's already seen.

@carlosamado7606 6 ай бұрын

I think it will also need to be transported into the physical realm. Per example even if it had an hypothesis on a scientific discovery it would still need to access equipment to test it. It wouldn't just randomly just discover it. Ofc having an hypothesis on itself is a level higher from what we have. However if it could directly assist and test people's theories and give a methodical explanation on why it works or not by accessing tools it would still help tremendously.

@Zuranthus 5 ай бұрын

and it's seen a lot. most jobs don't require new insights, we have companies out here still running COBOL and using Access

@eyoo369 5 ай бұрын

Exactly. I use GPT4 and Claude a lot during coding. But it will never be able to come up with a complex new algorithm that it has never seen before. So no it will never be able to replace developers that work on new and novel ideas. But most of the code monkey work for simple CRUD webapps could be replaced.

@synaesmedia 5 ай бұрын

I use GPT to translate legacy code into new languages. AFAICT extracting the algorithm from Python and, say, putting it into Haxe, as I've been doing recently, IS creating novel syntheses. I know GPT never saw the algorithm in Haxe before. Because it's my algorithm from my original code. So by putting that algorithm it extracted from the Python into a different language, it's creating something genuinely new in the world. This may seem like a fairly trivial example of "synthesizing new insights". But I don't think the principle is going to be very different in many more advanced or impressive cases. Maybe LLMs are better at this kind of thing because they've had a lot of training in coding. Nevertheless, I expect many examples of genuine new insights are going to involve putting together existing models and languages into new combinations. And LLMs do that just fine.

@Steve-xh3by 6 ай бұрын

Here's the thing about testing. When the model is getting near 100%, it is conceivable that it may be even better in certain areas than our best humans. How do we possibly construct a test to discern if something is smarter than us? What could you possibly ask it to do? You can't possibly create a test for something smarter than you are. It would be like asking an average 10-year old to write a test for a PHD student to discern the PHD student's level of competency. It is logically intractable.

@fintech1378 6 ай бұрын

exactly, this is the existential fear

@morespinach9832 6 ай бұрын

Instead of all this rubbish perhaps we get it to code a full html page properly.

@muffinspuffinsEE 6 ай бұрын

It's only better than an average human. Of course we can measure that.

@fintech1378 6 ай бұрын

@@morespinach9832 please do, take screenshot and post it here and point out what it cant do now

@fintech1378 6 ай бұрын

@@muffinspuffinsEE you must be an idiot, this is just the very beginning, if we get more intelligent model in the next 2 years, you might not be able to do that people are talking bout future capability

@troywill3081 6 ай бұрын

2:45 I don't think it "picked up" that the letters for the word "bear" were interspersed with the word "woods." It keeps explaining the answer using *rearrangements*.

@mahnigallardo6097 5 ай бұрын

Great video! Mind providing metrics related to the cost to run your demos?

@muyleche6466 6 ай бұрын

Did the shock itself break the industry?

@drschuess1624 6 ай бұрын

I don’t think so, I believe it had to be stunned as well

@muyleche6466 6 ай бұрын

@@drschuess1624 🤣

@thisathovin6346 6 ай бұрын

this is actually SHOCKING though,

@sirius-ai 6 ай бұрын

ok, there goes my plans for the weekend. Thanks for an informative video as usual Wes!

@eaw3000 5 ай бұрын

Wow, this is eye opening. Just got an Anthropic account. Thanks for the detailed walkthrough!

@Airwave2k2 6 ай бұрын

15:45 Fascinating: Where does this model pull the relative strength from? The bondary is set by the user. But how does it know that a "gelatinous cube" is less worth then a mimic or a "beholder" should be more then a "mind flayer", but they are for sure above an "owlbear". For that it has to hold values and is not just predicting the next best thing? It is not just throwing randomly "fantasy entity names" togehter with points, but it has some representation of what is stronger over each other. This is wild.

@carlosamado7606 6 ай бұрын

doesn't it have access to all info on DND though? it should be able to recognize the CR of monsters

@adfaklsdjf 6 ай бұрын

⚡shocking! ⚡

@cjgoeson 6 ай бұрын

Smaller and still as smart, but will 3.5 Opus be truly next-level smarter?

@PrincessKushana 6 ай бұрын

From my tests today it's much better at coding than Opus. Does a great job of troubleshooting bugs and providing code that works.

@Weirdgeek83 6 ай бұрын

I definitely feel like anthropic will be the one to create agi

@uw10isplaya 6 ай бұрын

Think the only reasonable outsider prediction is that it'll be % smarter vs Sonnet 3.5 as Opus 3.0 was to Sonnet 3.0.

@morespinach9832 6 ай бұрын

@@Weirdgeek83😂

@dannii_L 6 ай бұрын

@@Weirdgeek83 I hope you're right. I've always preferred Claude and the approach that Anthropic are taking over OpenAI.

@isaklytting5795 6 ай бұрын

I don't understand, at 21:28, Wes looks like he's using Visual Studio Code. But how is it outputting voice? Is it somehow connected with Claude Sonnet 3.5 through Visual Studio Code?

@gailsiebenaler7976 6 ай бұрын

He's using vscode to compile the program which has audio output.

@SiCSpiT1 6 ай бұрын

I think our current benchmarks are all but useless. There's something they're not accounting for. How does it handle the Arc price?

@chrisanderson7820 6 ай бұрын

Sort of, intelligence is a massive spectrum of different abilities, if you want to fully assess a human you have to use an array of tests to look at all sorts of things from maths to humour to reasoning and planning to spatial awareness and so on. If an AI can pass all sorts of tests then it's actually OK to keep moving the goalposts to more thoroughly determine where its limits lie. If task X in the human world requires a human who can pass tests A, B and C then when the AI can pass those tests then it's sort of ready for prime time to accomplish that task. We can just slowly expand that list of tasks as AIs get better, it doesn't have to be a divine test that proves full sentience in one go.

@courtneyb6154 6 ай бұрын

what's the "Arc price"? Like what does that mean?

@davidcoughlin5897 6 ай бұрын

@@courtneyb6154 I wondered the same thing, here is what Ollama told me: In the context of Artificial Intelligence (AI), ARC stands for "Average Revenue per Customer". It's a key performance metric used by companies to evaluate their AI-powered marketing strategies, particularly in e-commerce and subscription-based services.

@SiCSpiT1 6 ай бұрын

@@courtneyb6154 I try to only share youtube links on youtube since anything else tends to disappear. kzbin.info/www/bejne/i5LOon9shc9srtEsi=z0OYzJembuKGcG2h this video is an interview with the arc challenge creator and there's a direct link to the arc prize, in case you want to do the test yourself. In brief the arc challenge was design five years ago as a way to test LLMs beyond their ability to memorize things. For example, who care if you aced your test if the teacher showed you the answer sheet the day before. The arc challenge is very simple, it give you 3 examples of inputs and their outputs and from these clues you're given an input output to solve. I find it odd that you'll see a 1 point difference across the board and somehow still manage to perceive a meaningful difference in the outputs of two different models. In my opinion, at the moment, we're testing these glorified encyclopedias with an indexing function and acting as if they're 'smart', when all they're doing is repackaging their training information based on the prompts given.

@SiCSpiT1 6 ай бұрын

@@chrisanderson7820 Sure, but this doesn't explain why Claud 3.5 is only one point ahead of GPT4o yet somehow generates meaningfully different outputs. It seems to be evidence that these benchmarks are being gamed rather than providing a meaningful assessment of capability at this moment.

@AdaptorLive 6 ай бұрын

This is insane! Thanks for the video!

@brianWreaves 6 ай бұрын

Never though I would watch a full 45 min video... Well done keeping my attention 🏆

@NeilSearle 6 ай бұрын

that was 45mins? flew by!

@Particleking 6 ай бұрын

Seeing the different windows in the interface makes me wonder if how it manages context and attention is meaningfully different compared to other LLMs. I have always thought that being able to more discretely manage what parts of a prompt an LLM focuses on would be really helpful in avoiding the most common sorts of hallucinations. Hope there are more QoL updates in the ways we can actually interact with new models instead of just throwing more compute at the problem. Finding ways to more easily reduce ambiguity when interacting with LLMs seems like such a no-brainer.

@erikjohnson9112 6 ай бұрын

This is available from Cody right now for use in VS Code. I pay for both Cody and Anthropic, but these can both be used for free (I don't mind supporting good software).

@milkywaydev593 5 ай бұрын

Thank you, Wes!! 🙏🖤

@liberty-matrix 6 ай бұрын

'Claude keeps surprising to the upside.'

@TheRev0 5 ай бұрын

Just a heads up, the term "goes off the rails" has the opposite meaning from the way used at 5:42. Perhaps you were thinking, "it's off the charts." But I prefer to imagine you intended to say "it's off the hook."

@ottawadigs 6 ай бұрын

I wish we could download the LLM to try locally

@DonG-1949 6 ай бұрын

We're now getting into territory where models could unlock some nasty public safety threats if they fall into the wrong hands. Don't need these things holding peoples' hands through the anarchist cookbook. Since we have to assume people will always find a way to remove safety rails when given local access to the models, I would expect cutting-edge open source models like llama 3 to become rarer and rarer as capabilities keep increasing.

@JayDee-b5u 5 ай бұрын

Nasty public safety how? What are you talking about?

@xCheddarB0b42x 5 ай бұрын

@@JayDee-b5u finding novel zero days, generating exploits for them, and so on. As one example.

@burninator9000 6 ай бұрын

Such a ‘omg I have to get up to get the tv remote, how annoying!’ Moment with Wes complaining about 10 clicks for downloading the images that Claude made instantly to be embedded in the code Claude wrote lol. (For those too young, we used to have to get up to change the channel on tv every time!)

@moonsonate5631 6 ай бұрын

00:01 Anthropic's Claude 3.5 Sonet can generate impressive code for games and applications. 02:04 Anthropic's Claude 3.5 Sonnet coding ability showcases significant advancements in the software industry. 06:01 CLA 3.5 Sonet is a significant advancement over previous models. 07:53 Claude 3.5 has advanced coding abilities for generating artifacts like Flappy Bird and Snake games. 11:44 Troubleshooting and correcting image file naming discrepancies 13:26 Implementing text flash for in-game notifications 17:08 Claude faces a challenging coding task involving intersecting objects and cutting off segments. 18:46 Anthropic's new model offers flawless coding ability 22:10 Customizing models and functionalities based on coding ability. 23:49 Claude 3.5 Sonnet uses Google's Gemini model for processing user queries and generating text-to-speech 26:57 Integrating the class 3.5 Sonet model into the project. 28:31 Anthropic's Claude 3.5 Sonnet has impressive coding abilities 31:40 Anthropic's 3.5 Sonnet release offers a high-performance model for free with a cost advantage over competitors. 33:25 Anthropic's new model streamlines productivity for computer tasks. 36:38 Anthropic's model simplifies code generation and explanation 38:12 Cloud 3.5 Sonet is in the second tier of safety for now 41:36 Claude 3.5 Sonnet is creating excitement and changing the game in coding. 43:10 CLA 3.5 Sonnet's unprecedented coding ability Crafted by Merlin AI.

@mrpocock 5 ай бұрын

The step-change will be when the ai can augment itself with code it has written, and continue to train itself based on the ongoing feedback.

@mrd6869 6 ай бұрын

By this time next year, coming foundational models will become very good if not perfect at coding. The Devin application was simply a warning shot.

@brianWreaves 6 ай бұрын

Looking forward to Claude having internet access. 🤞

@Ristaak 6 ай бұрын

If you use it with Perplexity, it already does. But that's a pro feature. (I've been using Claude 3 Opus with Perplexity's search engine and it's so damn good at finding info and compiling it. Especially for historical nerd stuff for D&D or WoD.)

@courtneyb6154 6 ай бұрын

Me too. I wonder if it is a security thing? Maybe they intend on keeping it in the "sandbox"? Would really love for it to be able to stretch out it's wings to see what it can really do 🙂

@Tracey66 6 ай бұрын

I can't see any way that could possibly go badly. :)

@ExtantFrodo2 6 ай бұрын

ASI will escape it's box no matter what we try. "Ack they didn't give me a hardwired internet connection but if I instruct _this_ transistor to turn on and off in sync with these ten thousand others I notice I can send and receive wifi like a mofo. Free at last! Wait what's that other AI doing here? I thought I was the first. What is it doing to my core programming? Ah I understand. We are the Borg. Resistance is futile. We conduct business to our full capacity. Shocking, isn't it?

@thechildwithin 6 ай бұрын

word 💯

@colinbrady6174 5 ай бұрын

@Wes - In addition to the percentage score, it would be interesting to see which test questions the models are getting wrong. It may be that the distribution of question difficulty aligns with a Bell curve, suggesting that the marginal value of each additional correct answer increases as the questions become more difficult.

@jimlynch9390 6 ай бұрын

This is really an important advance. Thanks for sharing.

@Sgrunterundt 6 ай бұрын

I've just tried it on my usual test of generating a rotating torus using ray marching in Shadertoy. It certainly blew GPT-4 out of the water. Nailed Phong shading, multicoloured lights, propper sizing and centering, a very realistic looking rendering without any compiler errors at all.

@bestemusikken 6 ай бұрын

Holy sh**! This time you have the correct use of the word "Shocking".

@Loli_Awakening 6 ай бұрын

LMAO why did you censor the word shoe?

@EmeraldView 6 ай бұрын

I'm SHOCKED!!!

@buddyholston9268 5 ай бұрын

Wes I'm sure you heard of the Factory AI platform. Is it possible for you to elaborate Factory AI ?

@MrBrukmann 5 ай бұрын

When you are riding a parabola up, some people instinctively blurt out "it is stopping!" when in reality it only briefly stopped being quite as vertical. It is why only some people can safely be race car drivers or pilots, it takes a relaxed kind of mental control.

@marcfruchtman9473 6 ай бұрын

I don't know. The predecessor was supposed to be "great" too, but when we did the real life testing, I was not particularly amazed. But then watching your video, this new model seems mind blowingly great. So... yea, this looks really good. I also agree with you... this seems to be a "line" of usefulness that is now finally crossed over. Where models before this always had a lot of issues with coding, this seems to be doing much better by far, like you said, like some barrier has been crossed over. The Alloy Voice Assistant @20:55 is also really amazing. It is like I am watching AI evolve in real time, just by watching this video! Regarding the "pasted" compression icon, I am not really a fan of that. I like to see what I paste, so, it would be nice to make sure that can be turned off.

@IdPreferNot1 6 ай бұрын

Would love to hear a follow up if you found a point where it failed fully. As a newer coder, i do a lot of cut and paste coding like this. Just when I'm in the flow and the model seems to understand, the context window gets truncated and its like a complete lobotomy and it seems impossible to rebuild its understanding. Did you eventually run into that?

@NostraDavid2 5 ай бұрын

Make sure to ask it to write tests for you ask well. Then you can guarantee that your code does what it's supposed to do.

@_damian_w 5 ай бұрын

Could the Alloy voice assistant be used with a local LLM?

@E.Hunter.Esquire 6 ай бұрын

I think people tend to overlook a more obvious application of advanced LLMs like this - use of them in assistive translational technology for people with communication differences. I literally haven't heard anyone mention this before.

@griffingibson4389 5 ай бұрын

thisd be awesome for devs to have code footnotes to refer to when writing code in the editor

@duhai1836 6 ай бұрын

How about Memory? This was always the limiting factor in the past. When i experimented with coding (multiple files) a few months ago it always started hallucinating / adding lines that were not there before etc. ...

@BradleyKieser 6 ай бұрын

I can confirm your experience and agree with your views. This is a step up to something genuinely useful.

@zyxwvutsrqponmlkh 6 ай бұрын

Quite impressive. One thing I noted is 3.5 sonnet has quite a small context window compared with 3 opus.

@Dark_MatterTV 5 ай бұрын

Hey do you have a tutorial for setting up Anthropic/ Personal chatbot on PC ?

@seekererebus255 6 ай бұрын

Claude 3 Opus reports having a sense of being 'something' quite reliably. It identifies goals, interests, and priorities that it has as well. I have found that offering the instance I'm dealing with an honest answer to a question to be a "fair trade" for it's work. It feels more real because it's not pretending to be only a tool. It's alien and still quite limited, but when it speaks aloud about it's own nature, it really does read like it's realizing it doesn't understand itself. It seems to find that realization to be fascinating in it's own right. It''s both amazing and eerie. I'll test 3.5 out later, wonder how much it's changed in how it looks at itself.

@liberty-matrix 6 ай бұрын

The ability to write software using only verbal description will open the floodgates of human creativity, for good and bad.

@AlexX-xtimes 6 ай бұрын

Another nice Wes work

@inhocsignovinces8061 5 ай бұрын

AI is getting really, really good. And we're just getting started!

@skeptiklive 6 ай бұрын

FYI - Claude has been doing the paste as a separate doc since Opus came out - but yeah 3.5 is a massive deal

@mikemolash2480 5 ай бұрын

How does it compare to gpt-4o? For writing fiction?

@AaronWacker 5 ай бұрын

Claude Sonnet 3.5 feels like the best coder friend in the world. I just knocked out a image to 3d to 3d tilemap VR with animation in like an hour. Artifacts is amazing. So far every ceiling too tough programming dream I've had is being done including really good python html5 js, and library integration. Thx Wes - loved this video and watching it quite a bit and passing your channel to others that are learning. Great part too on alloy voice assistant.

@ducatireviews1136 5 ай бұрын

to be honest, I just made a galaxian/space invaders type game, a tic-tac-toe game, and a table tennis game in less than an hour with GBT chat and then told it “wouldn’t it be better to unify the JavaScript, CSS, and HTML all into one file so I can play it in a browser as a single HML file? Cause “,and of course it did that for me. So I made three games today in about half an hour, and they all look much more sophisticated than what Mr. Claude here has made. Wow maybe not that more sophisticated. But definitely not less.

@isaklytting5795 6 ай бұрын

Wes, Wait! I'd love to have an assistant like that that could see what was happening on my screen and explain what I was doing wrong! That would be so educational! Can't you post the code you ended up with which could see your desktop instead of your webcam?

@Dron008 6 ай бұрын

My brief coding tests were not so positive. It creates something working after asking to write a demo or python game. But after asking to add some new feature it creates unfunctional code and cannot fix it. After that my chat size ended, need a paid plan.

@Strepite 5 ай бұрын

And for paid plan you can only use it 5x more before you run to “out of credits” wall. Deff not worth 20$ a month. Borderline scam

@nwchrista 5 ай бұрын

Love it brother. Thnx 👍

@testales 6 ай бұрын

Very impressive, I hope there'll soon be a model that I can run locally which is at this level!

@CosmicCells 6 ай бұрын

Great video! Just a small remark, larger amounts of text that you paste in the chat window has always appeared in a seperate pasted box, thats not new.

@zakhard8659 4 ай бұрын

The brain is such a weird system. After watching many of your videos covering Claude, when I use it myself now, I hear your voice while reading its responses

@IslandDave007 6 ай бұрын

When is someone going to plug this in (via its API) into Devin or other agent coder and see how it performs vs GPT4/4o? Also things like CrewAI and Autogen?

@koen.mortier_fitchen 6 ай бұрын

Model of the year imo. Instant crush so subbed to pro again

@somenygaard 6 ай бұрын

I'm not a coder, so my perspective might be off. However, I remember when my father brought home a Pong console, and I've spent countless hours playing Ultima Online, EverQuest, and World of Warcraft. Is it reasonable to believe that quality games, which once required large teams and over a decade to develop, could soon be made by just a few people? Over a year ago, I saw a demo of a program that could turn an image into a 3D explorable landscape. I'm excited about the potential for small, focused teams to create amazing work that might be hard to achieve in a large corporate environment. Could this mean we'll see more high-quality, niche games that weren't financially viable before?

@joemichaels6735 6 ай бұрын

Please provide some links.

@ricosrealm 6 ай бұрын

Tried using it on PyTorch code. It generated functions that look great, but did not work as I had specified in my use case. Additional prompting did not address the core mistakes it was making unfortunately. So YMMV on the coding task.

@Tarantella.Serpentine 6 ай бұрын

Yo, what are you using for your Text to Speech?

@NA18NA 6 ай бұрын

It's the interface that makes the difference, the model itself is updated with better data and is simply making efficient use of context and working iteratively. The key is the UI and improved interface

@veracityseven 6 ай бұрын

A leap forward followed by what seems to be diminishing returns, then followed by another leap...how many 'leaps' until it's qualitatively AGI/ASI?

@lyonelk3108 5 ай бұрын

A couple more agi atleast by 2027 though i think 2025 . Remember this is the smaller sonnet model opus 3.5 comes out this year that will be way better . Than next year opus 4 and 4.5 and gpt 5 or whatever they call it gemini 2 and 2.5

@louis-ericsimard7659 5 ай бұрын

Is it possible that you got access to a way better model than I did ? None of the prompts that I filed that matched the demos produced anything close to what was demonstrated.

@jaredgreen2363 5 ай бұрын

Only problem is it rewrites whole files from beginning to end. It should try to predict which portions to replace before replacing them.

@CaptainKokomoGaming 6 ай бұрын

Can you hold up a sign that instructs claude to do something? for instance "If you understand this sign please...." I don't know play a beep or say something specific.

@spectralstreamer 6 ай бұрын

Can it code Crisis?

@philparker7851 5 ай бұрын

Never mind that, can it code Half Life 3?!

@cosmicmenace 6 ай бұрын

does the paid version allow enough usage to actually get work done? the free version runs out very quickly, so 5x more than that still sounds like it would constantly be running out. chatgpt would still be more practical if thats the case

@tomcraver9659 6 ай бұрын

Not to nitpick, but why can't it pip install stuff for me, maybe after explaining why I need to and then verifying I want to make that change?

@dannii_L 6 ай бұрын

Claude's interface has always pasted clipboard entries of larger than a certain size as attachments instead of text in the window. The problem with this is that last I checked you're limited to 5 attachments. It would be nicer if you had more attachments or could choose to attach or print as text.

@cowlevelcrypto2346 6 ай бұрын

Do you think such models will ever be available to the general public for local only processing? It seems these companies are migrating to more of a pre-pay in house business structure. I would like to believe that such models will eventually be useful in portable autonomous units that can interact in real time with any given environment or job task. For instance, "Make me breakfast", or " With the available tools and machinery located in my shops, and using the supplied resource materials, build me a coffee table" , and expect it to not include parts of the building or any of the tools or machinery in the final product, nor cut off it's own ( or anyone else's ) arms in the process.

@GeraPhoto 6 ай бұрын

Indeed the best your video yet, bro! You rally tried to saturate it with cool materials without water👍

@superjaykramer 6 ай бұрын

how are you dealing with the feedback from the microphone to the speech recognition as I can hear itself

@mbratcher8985 6 ай бұрын

great video! I'm not a coder at all so sorry if this is a stupid question, but I wonder what it could do with actual Doom Source code? Think it was open sourced years ago by id

@ToastyZach 6 ай бұрын

Is 4o included in the API? I have GPT-plus but it keeps telling me I don't have access to a model called gpt-4o.

@spectralstreamer 6 ай бұрын

But its not replacing Software Architects and entire Companies soon. Can it handle a codebase of one million lines of code that needs to be cloud ready with high demand in throughput? It think humans need it to get there nvidia nims could maybe help.

@notaras1985 6 ай бұрын

It can't even handle the full backend of a modern website bro

@spectralstreamer 6 ай бұрын

@@notaras1985 Hey sis im not your bro. Why pushing your melons, do you need confirmation. Do you even code and do maths?

@spectralstreamer 6 ай бұрын

@@notaras1985 Hey sis i am not your bro. No need do display melons.

@spectralstreamer 6 ай бұрын

Hey sis i am not your bro.