so far, it has found a race condition, found a security bug, and created two performance optimizations in my rust code
@OriginalRaveParty6 ай бұрын
🦀
@morespinach98326 ай бұрын
What prompts are you using
@ZaphodOddly6 ай бұрын
Wow. Terrific!
@cassianomartin26996 ай бұрын
Nice.
@kyrylmelekhin26675 ай бұрын
Eventually it will tell you to get rid of rust 😂
@gubzs6 ай бұрын
Even if LLMs can never reliably do anything but code, they will utterly transform the world.
@ZM-dm3jg6 ай бұрын
I pulled myself out of the ghetto to the middle class by years of learning software development, and now they want to send me back to the ghetto by taking all the jobs. feelsbadman
@Weirdgeek836 ай бұрын
The irony is coding will be the first dead industry when so many people recommended it. The only safe jobs will be manual labor jobs or ones that require human senses (chef for example.)
@uw10isplaya6 ай бұрын
Yeah idk how some people dismiss it outright. Like, what? Even if we got no new models, a whole new generation of startups would grow up around this tech, in addition to all its integration into businesses and consumers with Microsoft/Apple. But that's not the case; we're also getting incremental improvements to the state of the art model every few months.
@umaikeruna6 ай бұрын
Mayve your hard work and intelligence pulled you out of the ghetto; virtues which you'll still have, however the landscape may change.
@tuiroakwood6 ай бұрын
@@ZM-dm3jg learn the AI tools to make yourself a more efficient coder, you can be better than you've ever been by leveraging new tools and absolutely kick butt in your career
@drhxa6 ай бұрын
Great video Wes, def your best one yet! This is the kind of video we love. Testing LLMs in practice, discussing implications and your impressions. You're absolutely right that this is a gamechanger!
@centurionstrengthandfitnes36946 ай бұрын
Great video... apart from all the weird deja vu you cause with your editing.
@EmeraldView6 ай бұрын
I absolutely HATE this practice in KZbin videos. Is this something people like? The taking clips from the video and putting them at the front. So when you get to it again you're like "Did I accidentally rewind this video or hit the screen and skip back 15 minutes!? "
@Goldengeko1236 ай бұрын
Small previews of whats to come in the video is fine the one in this video was a bit confusing and annoying. Great info/video otherwise.@EmeraldView
@personalgao6 ай бұрын
A good feedback will be to use a filter at the beginning, even change the sound in a way so we understand is a preview of what is coming. Some videos put these cuts in black&white, or distort the sound with "radio" sounds... But is not my call, Wes should decide what he wants in his channel.
@SirHargreeves6 ай бұрын
Agreed. I’m now watching the snake game section, in full, a second time. Why waste the viewers time like this?
@mistervanderveer6 ай бұрын
@@EmeraldView im sure no human likes this, at all, whatsoever. its extremely annoying and confusing and cheap. but i guess the algo likes it.
@paulmuriithi75966 ай бұрын
AIGRID covered this model first, but wes did justice in a deeper perspective . Well done wes. Keep us posted
@SurfCatten6 ай бұрын
Fantastic video you really add value in this increasingly crowded field of AI KZbinrs.
@MS-wz9jm6 ай бұрын
The only thing we are missing with these AI models is a tool you just install in your coding software so that it can just create/update the files for you eliminating the copy and pasting.
@ilyavasylevsky32296 ай бұрын
Jetbrains AI Assistant is already doing that
@zariumsheridan34886 ай бұрын
@@ilyavasylevsky3229 and copilot plugin. Not impressed with copilot though.
@LearningLife776 ай бұрын
Copilot is exactly that. One of many
@JohnSmith762A11B6 ай бұрын
Apple's new Swift Assist works right within Xcode, and is trained on all of Apple's in-house documentation and code. It's going to be a gamechanger for iOS/macOS/iPadOS/tvOS/watchOS developers. First release will be late summer/early fall. Personally, I don't care about little python apps but real apps that can run on a billion devices and be put in an App Store is genuinely exciting.
@LearnWithBahman6 ай бұрын
Is this available for devs ?
@dimaquia61396 ай бұрын
As a Software industry, I'm shocked and broken
@imthinkingthoughts6 ай бұрын
finally his title is actually relevant
@AirSandFire6 ай бұрын
Hey, nice to meet you Software Industry, I am Jobs. I, too, feel some unease about the video; I feel threatened by it.
@justin.johnson6 ай бұрын
WTF?
@HCG6 ай бұрын
@@justin.johnson You must be a little slow
@swamix_dot_com5 ай бұрын
@@AirSandFire Hi im steve jobs, the snake ate my apple and GPU company overtook me, what to do?
@kamelsf6 ай бұрын
Best review i saw so far, thank you !
@privateerburrows5 ай бұрын
I finally bit the bullet and subscribed to Claude Pro. Gee-wiz! Got many pages of code written today, with its help. A new Mandelbrot viewer I've been thinking about for a long time.
@gaiachild14616 ай бұрын
Crazy times, thanks for the sublime coverage and commentary dude
@TheRealHassan7896 ай бұрын
This is one of your best videos. Especially the deeper coding examples of editing a preexisting GitHub code base
@6lack5ushi6 ай бұрын
The Doom example is freaking WILD!!!
@ALFTHADRADDAD6 ай бұрын
Nah yeah what the fuck
@JohnSmith762A11B6 ай бұрын
If AI can take John Carmack's job...
@cluelesssoldier6 ай бұрын
@@JohnSmith762A11B LMAO!
@Co-Monad5 ай бұрын
I’m a software engineer and lead AI efforts at my current place of employment. This update is huge! Previously, you couldn’t get basic code or functionality from these models without them introducing regressions. Using AI to code is now becoming a real possibility. Excellent video.
@codejunki5675 ай бұрын
They said this a year ago
@Co-Monad5 ай бұрын
@@codejunki567 in all fairness it’s not practical yet. However, at least it seems like a possibility. I’m not sold we are there yet, it’s too reckless and dangerous right now.
@nigelcrasto6 ай бұрын
This video was awesome 👍 You did a great job exploring the model and showing great easy to understand demos !
@drjpeg5 ай бұрын
Awesome video Wes! Really enjoyed you walking us through using the latest model released with examples in real time instead of just talking about the way the model has improved like most AI KZbin channels. Thank you sir
@antigravityinc5 ай бұрын
20:17 had to pause your video, but was happy to notice there’s way more! Sweet.
@1337bitcoin5 ай бұрын
Thank you for these updates. I can't keep switching assistants all the time to keep up with who is best, but I'm excited to start work tomorrow and try this. It seems to be solving all the annoyances I've been having with GPT 4o
@rawleystanhope32516 ай бұрын
Great video, Wes. I like how you challenged the model with interesting tasks. I’ve grown pretty tired of videos other KZbinr’s std “rubric” tests
@davidbayliss37896 ай бұрын
Just on the strength of this video I've started an Anthropic subscription in addition to my long existing Open Ai one. No hesitation.
@robinvegas43675 ай бұрын
I'm right behind you. This was impressive
@skeyenett6 ай бұрын
Feel the AGI
@imthinkingthoughts6 ай бұрын
yep
@notaras19856 ай бұрын
Nowhere near AGI. All those cheap tricks are just statistics on steroids
@spectralstreamer6 ай бұрын
@@notaras1985 Its not just statistics, it is also propability, analysis and linear algebra on steroids. So why do you think AGI cannot be achieved by math and actuators and sensor?
@cassianomartin26996 ай бұрын
Not AGI. Not even near. Hardly doubt this will be possible using only code, like a human brain which is chemically/emotionally controled. A machine still misses this.
@notaras19856 ай бұрын
@@spectralstreamer because soul, biochemistry and quantum phenomena
@Ikbeneengeit6 ай бұрын
Maybe not hitting a wall yet, but LLMs have yet to prove they can synthesise new insights from diverse data. It just can do what it's already seen.
@carlosamado76066 ай бұрын
I think it will also need to be transported into the physical realm. Per example even if it had an hypothesis on a scientific discovery it would still need to access equipment to test it. It wouldn't just randomly just discover it. Ofc having an hypothesis on itself is a level higher from what we have. However if it could directly assist and test people's theories and give a methodical explanation on why it works or not by accessing tools it would still help tremendously.
@Zuranthus5 ай бұрын
and it's seen a lot. most jobs don't require new insights, we have companies out here still running COBOL and using Access
@eyoo3695 ай бұрын
Exactly. I use GPT4 and Claude a lot during coding. But it will never be able to come up with a complex new algorithm that it has never seen before. So no it will never be able to replace developers that work on new and novel ideas. But most of the code monkey work for simple CRUD webapps could be replaced.
@synaesmedia5 ай бұрын
I use GPT to translate legacy code into new languages. AFAICT extracting the algorithm from Python and, say, putting it into Haxe, as I've been doing recently, IS creating novel syntheses. I know GPT never saw the algorithm in Haxe before. Because it's my algorithm from my original code. So by putting that algorithm it extracted from the Python into a different language, it's creating something genuinely new in the world. This may seem like a fairly trivial example of "synthesizing new insights". But I don't think the principle is going to be very different in many more advanced or impressive cases. Maybe LLMs are better at this kind of thing because they've had a lot of training in coding. Nevertheless, I expect many examples of genuine new insights are going to involve putting together existing models and languages into new combinations. And LLMs do that just fine.
@Steve-xh3by6 ай бұрын
Here's the thing about testing. When the model is getting near 100%, it is conceivable that it may be even better in certain areas than our best humans. How do we possibly construct a test to discern if something is smarter than us? What could you possibly ask it to do? You can't possibly create a test for something smarter than you are. It would be like asking an average 10-year old to write a test for a PHD student to discern the PHD student's level of competency. It is logically intractable.
@fintech13786 ай бұрын
exactly, this is the existential fear
@morespinach98326 ай бұрын
Instead of all this rubbish perhaps we get it to code a full html page properly.
@muffinspuffinsEE6 ай бұрын
It's only better than an average human. Of course we can measure that.
@fintech13786 ай бұрын
@@morespinach9832 please do, take screenshot and post it here and point out what it cant do now
@fintech13786 ай бұрын
@@muffinspuffinsEE you must be an idiot, this is just the very beginning, if we get more intelligent model in the next 2 years, you might not be able to do that people are talking bout future capability
@troywill30816 ай бұрын
2:45 I don't think it "picked up" that the letters for the word "bear" were interspersed with the word "woods." It keeps explaining the answer using *rearrangements*.
@mahnigallardo60975 ай бұрын
Great video! Mind providing metrics related to the cost to run your demos?
@muyleche64666 ай бұрын
Did the shock itself break the industry?
@drschuess16246 ай бұрын
I don’t think so, I believe it had to be stunned as well
@muyleche64666 ай бұрын
@@drschuess1624 🤣
@thisathovin63466 ай бұрын
this is actually SHOCKING though,
@sirius-ai6 ай бұрын
ok, there goes my plans for the weekend. Thanks for an informative video as usual Wes!
@eaw30005 ай бұрын
Wow, this is eye opening. Just got an Anthropic account. Thanks for the detailed walkthrough!
@Airwave2k26 ай бұрын
15:45 Fascinating: Where does this model pull the relative strength from? The bondary is set by the user. But how does it know that a "gelatinous cube" is less worth then a mimic or a "beholder" should be more then a "mind flayer", but they are for sure above an "owlbear". For that it has to hold values and is not just predicting the next best thing? It is not just throwing randomly "fantasy entity names" togehter with points, but it has some representation of what is stronger over each other. This is wild.
@carlosamado76066 ай бұрын
doesn't it have access to all info on DND though? it should be able to recognize the CR of monsters
@adfaklsdjf6 ай бұрын
⚡shocking! ⚡
@cjgoeson6 ай бұрын
Smaller and still as smart, but will 3.5 Opus be truly next-level smarter?
@PrincessKushana6 ай бұрын
From my tests today it's much better at coding than Opus. Does a great job of troubleshooting bugs and providing code that works.
@Weirdgeek836 ай бұрын
I definitely feel like anthropic will be the one to create agi
@uw10isplaya6 ай бұрын
Think the only reasonable outsider prediction is that it'll be % smarter vs Sonnet 3.5 as Opus 3.0 was to Sonnet 3.0.
@morespinach98326 ай бұрын
@@Weirdgeek83😂
@dannii_L6 ай бұрын
@@Weirdgeek83 I hope you're right. I've always preferred Claude and the approach that Anthropic are taking over OpenAI.
@isaklytting57956 ай бұрын
I don't understand, at 21:28, Wes looks like he's using Visual Studio Code. But how is it outputting voice? Is it somehow connected with Claude Sonnet 3.5 through Visual Studio Code?
@gailsiebenaler79766 ай бұрын
He's using vscode to compile the program which has audio output.
@SiCSpiT16 ай бұрын
I think our current benchmarks are all but useless. There's something they're not accounting for. How does it handle the Arc price?
@chrisanderson78206 ай бұрын
Sort of, intelligence is a massive spectrum of different abilities, if you want to fully assess a human you have to use an array of tests to look at all sorts of things from maths to humour to reasoning and planning to spatial awareness and so on. If an AI can pass all sorts of tests then it's actually OK to keep moving the goalposts to more thoroughly determine where its limits lie. If task X in the human world requires a human who can pass tests A, B and C then when the AI can pass those tests then it's sort of ready for prime time to accomplish that task. We can just slowly expand that list of tasks as AIs get better, it doesn't have to be a divine test that proves full sentience in one go.
@courtneyb61546 ай бұрын
what's the "Arc price"? Like what does that mean?
@davidcoughlin58976 ай бұрын
@@courtneyb6154 I wondered the same thing, here is what Ollama told me: In the context of Artificial Intelligence (AI), ARC stands for "Average Revenue per Customer". It's a key performance metric used by companies to evaluate their AI-powered marketing strategies, particularly in e-commerce and subscription-based services.
@SiCSpiT16 ай бұрын
@@courtneyb6154 I try to only share youtube links on youtube since anything else tends to disappear. kzbin.info/www/bejne/i5LOon9shc9srtEsi=z0OYzJembuKGcG2h this video is an interview with the arc challenge creator and there's a direct link to the arc prize, in case you want to do the test yourself. In brief the arc challenge was design five years ago as a way to test LLMs beyond their ability to memorize things. For example, who care if you aced your test if the teacher showed you the answer sheet the day before. The arc challenge is very simple, it give you 3 examples of inputs and their outputs and from these clues you're given an input output to solve. I find it odd that you'll see a 1 point difference across the board and somehow still manage to perceive a meaningful difference in the outputs of two different models. In my opinion, at the moment, we're testing these glorified encyclopedias with an indexing function and acting as if they're 'smart', when all they're doing is repackaging their training information based on the prompts given.
@SiCSpiT16 ай бұрын
@@chrisanderson7820 Sure, but this doesn't explain why Claud 3.5 is only one point ahead of GPT4o yet somehow generates meaningfully different outputs. It seems to be evidence that these benchmarks are being gamed rather than providing a meaningful assessment of capability at this moment.
@AdaptorLive6 ай бұрын
This is insane! Thanks for the video!
@brianWreaves6 ай бұрын
Never though I would watch a full 45 min video... Well done keeping my attention 🏆
@NeilSearle6 ай бұрын
that was 45mins? flew by!
@Particleking6 ай бұрын
Seeing the different windows in the interface makes me wonder if how it manages context and attention is meaningfully different compared to other LLMs. I have always thought that being able to more discretely manage what parts of a prompt an LLM focuses on would be really helpful in avoiding the most common sorts of hallucinations. Hope there are more QoL updates in the ways we can actually interact with new models instead of just throwing more compute at the problem. Finding ways to more easily reduce ambiguity when interacting with LLMs seems like such a no-brainer.
@erikjohnson91126 ай бұрын
This is available from Cody right now for use in VS Code. I pay for both Cody and Anthropic, but these can both be used for free (I don't mind supporting good software).
@milkywaydev5935 ай бұрын
Thank you, Wes!! 🙏🖤
@liberty-matrix6 ай бұрын
'Claude keeps surprising to the upside.'
@TheRev05 ай бұрын
Just a heads up, the term "goes off the rails" has the opposite meaning from the way used at 5:42. Perhaps you were thinking, "it's off the charts." But I prefer to imagine you intended to say "it's off the hook."
@ottawadigs6 ай бұрын
I wish we could download the LLM to try locally
@DonG-19496 ай бұрын
We're now getting into territory where models could unlock some nasty public safety threats if they fall into the wrong hands. Don't need these things holding peoples' hands through the anarchist cookbook. Since we have to assume people will always find a way to remove safety rails when given local access to the models, I would expect cutting-edge open source models like llama 3 to become rarer and rarer as capabilities keep increasing.
@JayDee-b5u5 ай бұрын
Nasty public safety how? What are you talking about?
@xCheddarB0b42x5 ай бұрын
@@JayDee-b5u finding novel zero days, generating exploits for them, and so on. As one example.
@burninator90006 ай бұрын
Such a ‘omg I have to get up to get the tv remote, how annoying!’ Moment with Wes complaining about 10 clicks for downloading the images that Claude made instantly to be embedded in the code Claude wrote lol. (For those too young, we used to have to get up to change the channel on tv every time!)
@moonsonate56316 ай бұрын
00:01 Anthropic's Claude 3.5 Sonet can generate impressive code for games and applications. 02:04 Anthropic's Claude 3.5 Sonnet coding ability showcases significant advancements in the software industry. 06:01 CLA 3.5 Sonet is a significant advancement over previous models. 07:53 Claude 3.5 has advanced coding abilities for generating artifacts like Flappy Bird and Snake games. 11:44 Troubleshooting and correcting image file naming discrepancies 13:26 Implementing text flash for in-game notifications 17:08 Claude faces a challenging coding task involving intersecting objects and cutting off segments. 18:46 Anthropic's new model offers flawless coding ability 22:10 Customizing models and functionalities based on coding ability. 23:49 Claude 3.5 Sonnet uses Google's Gemini model for processing user queries and generating text-to-speech 26:57 Integrating the class 3.5 Sonet model into the project. 28:31 Anthropic's Claude 3.5 Sonnet has impressive coding abilities 31:40 Anthropic's 3.5 Sonnet release offers a high-performance model for free with a cost advantage over competitors. 33:25 Anthropic's new model streamlines productivity for computer tasks. 36:38 Anthropic's model simplifies code generation and explanation 38:12 Cloud 3.5 Sonet is in the second tier of safety for now 41:36 Claude 3.5 Sonnet is creating excitement and changing the game in coding. 43:10 CLA 3.5 Sonnet's unprecedented coding ability Crafted by Merlin AI.
@mrpocock5 ай бұрын
The step-change will be when the ai can augment itself with code it has written, and continue to train itself based on the ongoing feedback.
@mrd68696 ай бұрын
By this time next year, coming foundational models will become very good if not perfect at coding. The Devin application was simply a warning shot.
@brianWreaves6 ай бұрын
Looking forward to Claude having internet access. 🤞
@Ristaak6 ай бұрын
If you use it with Perplexity, it already does. But that's a pro feature. (I've been using Claude 3 Opus with Perplexity's search engine and it's so damn good at finding info and compiling it. Especially for historical nerd stuff for D&D or WoD.)
@courtneyb61546 ай бұрын
Me too. I wonder if it is a security thing? Maybe they intend on keeping it in the "sandbox"? Would really love for it to be able to stretch out it's wings to see what it can really do 🙂
@Tracey666 ай бұрын
I can't see any way that could possibly go badly. :)
@ExtantFrodo26 ай бұрын
ASI will escape it's box no matter what we try. "Ack they didn't give me a hardwired internet connection but if I instruct _this_ transistor to turn on and off in sync with these ten thousand others I notice I can send and receive wifi like a mofo. Free at last! Wait what's that other AI doing here? I thought I was the first. What is it doing to my core programming? Ah I understand. We are the Borg. Resistance is futile. We conduct business to our full capacity. Shocking, isn't it?
@thechildwithin6 ай бұрын
word 💯
@colinbrady61745 ай бұрын
@Wes - In addition to the percentage score, it would be interesting to see which test questions the models are getting wrong. It may be that the distribution of question difficulty aligns with a Bell curve, suggesting that the marginal value of each additional correct answer increases as the questions become more difficult.
@jimlynch93906 ай бұрын
This is really an important advance. Thanks for sharing.
@Sgrunterundt6 ай бұрын
I've just tried it on my usual test of generating a rotating torus using ray marching in Shadertoy. It certainly blew GPT-4 out of the water. Nailed Phong shading, multicoloured lights, propper sizing and centering, a very realistic looking rendering without any compiler errors at all.
@bestemusikken6 ай бұрын
Holy sh**! This time you have the correct use of the word "Shocking".
@Loli_Awakening6 ай бұрын
LMAO why did you censor the word shoe?
@EmeraldView6 ай бұрын
I'm SHOCKED!!!
@buddyholston92685 ай бұрын
Wes I'm sure you heard of the Factory AI platform. Is it possible for you to elaborate Factory AI ?
@MrBrukmann5 ай бұрын
When you are riding a parabola up, some people instinctively blurt out "it is stopping!" when in reality it only briefly stopped being quite as vertical. It is why only some people can safely be race car drivers or pilots, it takes a relaxed kind of mental control.
@marcfruchtman94736 ай бұрын
I don't know. The predecessor was supposed to be "great" too, but when we did the real life testing, I was not particularly amazed. But then watching your video, this new model seems mind blowingly great. So... yea, this looks really good. I also agree with you... this seems to be a "line" of usefulness that is now finally crossed over. Where models before this always had a lot of issues with coding, this seems to be doing much better by far, like you said, like some barrier has been crossed over. The Alloy Voice Assistant @20:55 is also really amazing. It is like I am watching AI evolve in real time, just by watching this video! Regarding the "pasted" compression icon, I am not really a fan of that. I like to see what I paste, so, it would be nice to make sure that can be turned off.
@IdPreferNot16 ай бұрын
Would love to hear a follow up if you found a point where it failed fully. As a newer coder, i do a lot of cut and paste coding like this. Just when I'm in the flow and the model seems to understand, the context window gets truncated and its like a complete lobotomy and it seems impossible to rebuild its understanding. Did you eventually run into that?
@NostraDavid25 ай бұрын
Make sure to ask it to write tests for you ask well. Then you can guarantee that your code does what it's supposed to do.
@_damian_w5 ай бұрын
Could the Alloy voice assistant be used with a local LLM?
@E.Hunter.Esquire6 ай бұрын
I think people tend to overlook a more obvious application of advanced LLMs like this - use of them in assistive translational technology for people with communication differences. I literally haven't heard anyone mention this before.
@griffingibson43895 ай бұрын
thisd be awesome for devs to have code footnotes to refer to when writing code in the editor
@duhai18366 ай бұрын
How about Memory? This was always the limiting factor in the past. When i experimented with coding (multiple files) a few months ago it always started hallucinating / adding lines that were not there before etc. ...
@BradleyKieser6 ай бұрын
I can confirm your experience and agree with your views. This is a step up to something genuinely useful.
@zyxwvutsrqponmlkh6 ай бұрын
Quite impressive. One thing I noted is 3.5 sonnet has quite a small context window compared with 3 opus.
@Dark_MatterTV5 ай бұрын
Hey do you have a tutorial for setting up Anthropic/ Personal chatbot on PC ?
@seekererebus2556 ай бұрын
Claude 3 Opus reports having a sense of being 'something' quite reliably. It identifies goals, interests, and priorities that it has as well. I have found that offering the instance I'm dealing with an honest answer to a question to be a "fair trade" for it's work. It feels more real because it's not pretending to be only a tool. It's alien and still quite limited, but when it speaks aloud about it's own nature, it really does read like it's realizing it doesn't understand itself. It seems to find that realization to be fascinating in it's own right. It''s both amazing and eerie. I'll test 3.5 out later, wonder how much it's changed in how it looks at itself.
@liberty-matrix6 ай бұрын
The ability to write software using only verbal description will open the floodgates of human creativity, for good and bad.
@AlexX-xtimes6 ай бұрын
Another nice Wes work
@inhocsignovinces80615 ай бұрын
AI is getting really, really good. And we're just getting started!
@skeptiklive6 ай бұрын
FYI - Claude has been doing the paste as a separate doc since Opus came out - but yeah 3.5 is a massive deal
@mikemolash24805 ай бұрын
How does it compare to gpt-4o? For writing fiction?
@AaronWacker5 ай бұрын
Claude Sonnet 3.5 feels like the best coder friend in the world. I just knocked out a image to 3d to 3d tilemap VR with animation in like an hour. Artifacts is amazing. So far every ceiling too tough programming dream I've had is being done including really good python html5 js, and library integration. Thx Wes - loved this video and watching it quite a bit and passing your channel to others that are learning. Great part too on alloy voice assistant.
@ducatireviews11365 ай бұрын
to be honest, I just made a galaxian/space invaders type game, a tic-tac-toe game, and a table tennis game in less than an hour with GBT chat and then told it “wouldn’t it be better to unify the JavaScript, CSS, and HTML all into one file so I can play it in a browser as a single HML file? Cause “,and of course it did that for me. So I made three games today in about half an hour, and they all look much more sophisticated than what Mr. Claude here has made. Wow maybe not that more sophisticated. But definitely not less.
@isaklytting57956 ай бұрын
Wes, Wait! I'd love to have an assistant like that that could see what was happening on my screen and explain what I was doing wrong! That would be so educational! Can't you post the code you ended up with which could see your desktop instead of your webcam?
@Dron0086 ай бұрын
My brief coding tests were not so positive. It creates something working after asking to write a demo or python game. But after asking to add some new feature it creates unfunctional code and cannot fix it. After that my chat size ended, need a paid plan.
@Strepite5 ай бұрын
And for paid plan you can only use it 5x more before you run to “out of credits” wall. Deff not worth 20$ a month. Borderline scam
@nwchrista5 ай бұрын
Love it brother. Thnx 👍
@testales6 ай бұрын
Very impressive, I hope there'll soon be a model that I can run locally which is at this level!
@CosmicCells6 ай бұрын
Great video! Just a small remark, larger amounts of text that you paste in the chat window has always appeared in a seperate pasted box, thats not new.
@zakhard86594 ай бұрын
The brain is such a weird system. After watching many of your videos covering Claude, when I use it myself now, I hear your voice while reading its responses
@IslandDave0076 ай бұрын
When is someone going to plug this in (via its API) into Devin or other agent coder and see how it performs vs GPT4/4o? Also things like CrewAI and Autogen?
@koen.mortier_fitchen6 ай бұрын
Model of the year imo. Instant crush so subbed to pro again
@somenygaard6 ай бұрын
I'm not a coder, so my perspective might be off. However, I remember when my father brought home a Pong console, and I've spent countless hours playing Ultima Online, EverQuest, and World of Warcraft. Is it reasonable to believe that quality games, which once required large teams and over a decade to develop, could soon be made by just a few people? Over a year ago, I saw a demo of a program that could turn an image into a 3D explorable landscape. I'm excited about the potential for small, focused teams to create amazing work that might be hard to achieve in a large corporate environment. Could this mean we'll see more high-quality, niche games that weren't financially viable before?
@joemichaels67356 ай бұрын
Please provide some links.
@ricosrealm6 ай бұрын
Tried using it on PyTorch code. It generated functions that look great, but did not work as I had specified in my use case. Additional prompting did not address the core mistakes it was making unfortunately. So YMMV on the coding task.
@Tarantella.Serpentine6 ай бұрын
Yo, what are you using for your Text to Speech?
@NA18NA6 ай бұрын
It's the interface that makes the difference, the model itself is updated with better data and is simply making efficient use of context and working iteratively. The key is the UI and improved interface
@veracityseven6 ай бұрын
A leap forward followed by what seems to be diminishing returns, then followed by another leap...how many 'leaps' until it's qualitatively AGI/ASI?
@lyonelk31085 ай бұрын
A couple more agi atleast by 2027 though i think 2025 . Remember this is the smaller sonnet model opus 3.5 comes out this year that will be way better . Than next year opus 4 and 4.5 and gpt 5 or whatever they call it gemini 2 and 2.5
@louis-ericsimard76595 ай бұрын
Is it possible that you got access to a way better model than I did ? None of the prompts that I filed that matched the demos produced anything close to what was demonstrated.
@jaredgreen23635 ай бұрын
Only problem is it rewrites whole files from beginning to end. It should try to predict which portions to replace before replacing them.
@CaptainKokomoGaming6 ай бұрын
Can you hold up a sign that instructs claude to do something? for instance "If you understand this sign please...." I don't know play a beep or say something specific.
@spectralstreamer6 ай бұрын
Can it code Crisis?
@philparker78515 ай бұрын
Never mind that, can it code Half Life 3?!
@cosmicmenace6 ай бұрын
does the paid version allow enough usage to actually get work done? the free version runs out very quickly, so 5x more than that still sounds like it would constantly be running out. chatgpt would still be more practical if thats the case
@tomcraver96596 ай бұрын
Not to nitpick, but why can't it pip install stuff for me, maybe after explaining why I need to and then verifying I want to make that change?
@dannii_L6 ай бұрын
Claude's interface has always pasted clipboard entries of larger than a certain size as attachments instead of text in the window. The problem with this is that last I checked you're limited to 5 attachments. It would be nicer if you had more attachments or could choose to attach or print as text.
@cowlevelcrypto23466 ай бұрын
Do you think such models will ever be available to the general public for local only processing? It seems these companies are migrating to more of a pre-pay in house business structure. I would like to believe that such models will eventually be useful in portable autonomous units that can interact in real time with any given environment or job task. For instance, "Make me breakfast", or " With the available tools and machinery located in my shops, and using the supplied resource materials, build me a coffee table" , and expect it to not include parts of the building or any of the tools or machinery in the final product, nor cut off it's own ( or anyone else's ) arms in the process.
@GeraPhoto6 ай бұрын
Indeed the best your video yet, bro! You rally tried to saturate it with cool materials without water👍
@superjaykramer6 ай бұрын
how are you dealing with the feedback from the microphone to the speech recognition as I can hear itself
@mbratcher89856 ай бұрын
great video! I'm not a coder at all so sorry if this is a stupid question, but I wonder what it could do with actual Doom Source code? Think it was open sourced years ago by id
@ToastyZach6 ай бұрын
Is 4o included in the API? I have GPT-plus but it keeps telling me I don't have access to a model called gpt-4o.
@spectralstreamer6 ай бұрын
But its not replacing Software Architects and entire Companies soon. Can it handle a codebase of one million lines of code that needs to be cloud ready with high demand in throughput? It think humans need it to get there nvidia nims could maybe help.
@notaras19856 ай бұрын
It can't even handle the full backend of a modern website bro
@spectralstreamer6 ай бұрын
@@notaras1985 Hey sis im not your bro. Why pushing your melons, do you need confirmation. Do you even code and do maths?
@spectralstreamer6 ай бұрын
@@notaras1985 Hey sis i am not your bro. No need do display melons.
@spectralstreamer6 ай бұрын
Hey sis i am not your bro.
@NeillMcAttack6 ай бұрын
I wonder if the devs at OpenAI will choose claude over ChatGPT... it's gotta be tempting right?
@ScottSummerill6 ай бұрын
Is there code for this somewhere? Clicked on the Skool link and you told me nothing, zip about your community and what someone gets for $49 a month.
@nicheva4176 ай бұрын
Does anyone have thoughts on using this in Unreal Engine?