OpenAI’s new “deep-thinking” o1 model crushes coding benchmarks

  Рет қаралды 1,196,464

Fireship

Fireship

Күн бұрын

Пікірлер: 2 800
@divineaghulor3887
@divineaghulor3887 4 күн бұрын
"A car won't take your job, but a horse driving a car will"...deep stuff man
@slipknot1943
@slipknot1943 4 күн бұрын
Need tshirt with this written on it
@davefellows
@davefellows 4 күн бұрын
yup, that one made my day
@gamebros87
@gamebros87 4 күн бұрын
Same 🤯
@CyanRooper
@CyanRooper 4 күн бұрын
Damn, I had no idea BoJack Horseman was an Uber driver.
@pneumonoultramicroscopicsi4065
@pneumonoultramicroscopicsi4065 4 күн бұрын
that didn't age well
@SpontaneouslyDeliberate
@SpontaneouslyDeliberate 4 күн бұрын
If my job was coding solutions to problems with rigorously-defined requirements, this would be concerning.
@nixielee
@nixielee 4 күн бұрын
If my job ever had a single rigorously-defined requirement, I would be happy
@abhishek-soni
@abhishek-soni 4 күн бұрын
🤣🤣
@GSBarlev
@GSBarlev 4 күн бұрын
People around me have been pushing "natural language code gen" for a while now in the data analysis space, to which I say-anyone who can execute a clear and unambiguous data ask using _natural language_ more efficiently than they can construct the ideal SQL query or DataFrame op is a savant, of one form or another.
@josk8936
@josk8936 4 күн бұрын
I want to see how future pro ai managers, that fired all the developers, do when the client tells them the app just stopped working without other details and they have to find the error in the codebase with 20k lines of code that pass hundreds of states up and down the component tree like a seesaw
@Rugg-qk4pl
@Rugg-qk4pl 4 күн бұрын
That sounds like aerospace software development. I assure you they do not want AI code in their planes 😄
@arunkennedy9267
@arunkennedy9267 4 күн бұрын
I like how Turing test now is how many r's are there in Strawberry.
@saeidtafazzol3892
@saeidtafazzol3892 4 күн бұрын
lol
@esarmiento7
@esarmiento7 4 күн бұрын
hahahaha
@Gawroon7
@Gawroon7 4 күн бұрын
I have a friend who manages to say "strawberry" without using any of the "r" in it. This example shows that is also a philosophical issue.
@justaname999
@justaname999 4 күн бұрын
@@Gawroon7 I asked whether by "there are two Rs" chat GPT meant that there's only two phonemes of R. The reply was very off. something like "Yes, I mean actual graphemes. Even though the second R might be hard to perceive, there are still 2 Rs in the word "strawberry" in correctly spelled English" It's very funny.
@genghiskhan6688
@genghiskhan6688 4 күн бұрын
why is this task so hard anyway?
@last_fanboy_of_golb
@last_fanboy_of_golb 4 күн бұрын
PHD student here, the key to beat any LLM is to use a stick
@avg_user-dd2yb
@avg_user-dd2yb 2 күн бұрын
I'll beat you with that , you are useless now.
@roosterru
@roosterru Күн бұрын
Or a strawberry
@wesley6442
@wesley6442 Күн бұрын
Also, unplugging it from the wall socket xD
@avg_user-dd2yb
@avg_user-dd2yb Күн бұрын
@@last_fanboy_of_golb where to find this "stick" Is that some software?
@kindlin
@kindlin Күн бұрын
@@roosterru A strawberry on a stick. EDIT: Sorry, Strawbery.
@naeemulhoque1777
@naeemulhoque1777 4 күн бұрын
5:40 *"Ai won't take your job, but another man using Ai will.."*
@Monkeymario.
@Monkeymario. 4 күн бұрын
3-x
@rumfordc
@rumfordc 4 күн бұрын
another man with a decade of engineering experience, and a CS degree, using AI will* which is not too different to what was happening before AI. there's always been guys that are drastically faster than the average. the issue is that they're always rare and as tools and tasks become more complicated they become rarer.
@shipso6116
@shipso6116 4 күн бұрын
@@rumfordc yep, exactly. It's an eternal regularity and "using AI" is a coincidence here. They will win not because of "using AI", but because of being "at the top of their game", which *coincidentally* may now involve using AI, or may not. Different times different tools. May even find your own. Looking at the broad picture it's "staying ahead" what matters, not "using AI" per se. Those are not equal yet and hardly ever will be, at least for some parts of IT industry.
@moonwine7398
@moonwine7398 4 күн бұрын
​@@rumfordcthere will be day when AI will not need human for anything and it is coming within 5-6 years, so your quote HUMAN USING AI WILL REPLACE HUMAN WITHOUT AI which is a parrot quote repeated by many AI supporter is a blind and misleading quote. They are working to make AI more intelligent then human they don't need human intervention in AI
@rumfordc
@rumfordc 4 күн бұрын
@@moonwine7398 😆🤦‍♂ come back when you know what a quote is
@florduka
@florduka 4 күн бұрын
My HTML job is really gone now
@Yua_5
@Yua_5 4 күн бұрын
cry more😂
4 күн бұрын
Don't worry: no one knows how to do good HTML, neither the AI
@vasiovasio
@vasiovasio 4 күн бұрын
Front Page Express, Windows 98! 😊😊😊
@SamBrockmann
@SamBrockmann 4 күн бұрын
You're still coding in html? Oh, sh*t. 😂😂
@nicholasmaniccia1005
@nicholasmaniccia1005 4 күн бұрын
I've never been more unsure of a joke. Are you are saying it's easy to write proper HTML it's just no one does it. Or you think it is hard to write proper HTMl because everyone has their own opinion or something. Because it is really easy to write proper HTML just nobody does it because they don't see learning it or taking the time worth the effort for their genius brains.
@Trait74
@Trait74 4 күн бұрын
Thanks to fireship for almost giving me a heart attack at the beginning and then relieving me at the end lol
@bigboysdotcom745
@bigboysdotcom745 4 күн бұрын
That's literally his formula
@maxave7448
@maxave7448 4 күн бұрын
So, apparently this new million dollar idea from openai is just a self-proompter? Ironic how prompt "engineers" got replaced way before programmers ever could be
@w花b
@w花b 4 күн бұрын
​@@maxave7448 good.
@jhordanrojas9184
@jhordanrojas9184 4 күн бұрын
He's master that
@ethanfreeman1106
@ethanfreeman1106 4 күн бұрын
@@maxave7448 >prompt "engineers" got replaced hilarious how you pointed that out lol
@AwesomeDwarves
@AwesomeDwarves 4 күн бұрын
Most of my job as a software engineer is meetings, design, documentation, and watching Fireship. Sitting down to code probably only accounts for 20%. I'm either totally safe or I'm doing it wrong and I'm in imminent danger.
@AmandaVieiraMamaesouCult
@AmandaVieiraMamaesouCult 4 күн бұрын
I'm a data engineer. I spend more time talking to humans to figure out the requirements, quelling indecisive humans to create the requirements, translating the requirements into foundational/architectural decisions, clicking some stuff in whatever cloud tool I'm using and then, for a brief period of time, I code and maintain some intermediate level SQL in an 800-line query.
@callmeshen9754
@callmeshen9754 Күн бұрын
It's exactly how it should be, People just doesn't know how many projects companies (Mostly the big ones talking from experience) having so many projects on hold/delays. At very least for the next 5 years I guaranteed there is no need to panic, It will push more interns/juniors to certain projects they would've need been able to join beforehand. The question should be in that regards, What would happens in the far future if there won't be enough projects (Or the need for more)? It's less likely in the upcoming years but I'm sure it's very likely situation.. And there is a raise of CS degrees already so ye, There is a case here but at very least not in the near future.
@Buzmanm
@Buzmanm Күн бұрын
Your job isn't in danger, at least for now, it's juniors the ones that should be concerned, especially the ones graduating in 3 or 4 years. The barrier of entry has grown and will keep growing exponentially.
@RubenKelevra
@RubenKelevra 23 сағат бұрын
I'm pretty sure ChatGPT 4o is great at meetings. ;)
@Beknown107
@Beknown107 4 күн бұрын
O1 is a hilarious name for a program which has an exponential energy bill
@charfractal9441
@charfractal9441 Күн бұрын
LOL
@kindlin
@kindlin Күн бұрын
This comment section is next level.
@Ashash9877
@Ashash9877 4 күн бұрын
Call me when it can become a professional poker player or blackjack counter so I can make millions at Stake, or how about a pro stock trader or something? Why has no one used openAI for this yet? In the future OpenAI might run entire countries GDP systems💀 Welcome our overlords.
@HockeyMan666
@HockeyMan666 4 күн бұрын
LOL that probably exist already but you cant rly share that with the public can u?? use ur brain
@peyopeev8909
@peyopeev8909 4 күн бұрын
1.4k likes and nobody has mentioned that AI has been and it's used for both atm, you are for a wild ride pretty soon 😵‍💫
@bozydargroch9779
@bozydargroch9779 4 күн бұрын
@@peyopeev8909 yep. Botted likes?
@amaiaa8815
@amaiaa8815 4 күн бұрын
Been there done that
@TheBcoolGuy
@TheBcoolGuy 4 күн бұрын
"GDP systems"
@pandoraeeris7860
@pandoraeeris7860 4 күн бұрын
The cutting edge of Code Reports.
@perthecther__203
@perthecther__203 4 күн бұрын
EDGE
@vertas.y
@vertas.y 4 күн бұрын
@@perthecther__203 EDGE OR Chrome 😭😭😭😭😭😭😭😭😭😭😭😭😂😂😂😂😂😂😂😂😂😂😂😂😂😂😂😂😂😂😂😂😂😂😭😭😭😭😭😭😭😭😭😭😒😒😒😒😒😒😒😒🥰🥰🥰🥰🥰🥰🥰🥰☺☺☺☺☺☺☺☺😐😐😐😐😣😣😣😣😐😐🥳🥳🥳🥳🥳🥳🥳🥳
@CyanRooper
@CyanRooper 4 күн бұрын
OF
@Tozu25
@Tozu25 4 күн бұрын
The fact that everyone is forgetting for some reason is that AI will also take doctors, engineers, architechts, creators, actors, editors, pretty much everyones jobs. It will be mass unemployment = no livable society. Why should anyone be excited? We are witnessing the start of something really bad.
@Gregorythe5_5551st
@Gregorythe5_5551st 3 күн бұрын
​@@Tozu25 To be fair, if billions of people have nothing to lose i can't imagine companies can keep such a status quo going for long. I hope anyway.
@marc-io
@marc-io 4 күн бұрын
Impressive it can beat PhD students. But remember a PhD in breakdancing is not the same as being a breakdancer. This one could be called GPT-Raygun.
@SkegAudio
@SkegAudio 4 күн бұрын
😂 good one
@gabrielbarrantes6946
@gabrielbarrantes6946 4 күн бұрын
what exactly means it can "beat phd students"? I suspect is faster pretty well known problems that are well documented over the interned lol, so totally worthless.
@randomlettersqzkebkw
@randomlettersqzkebkw 4 күн бұрын
@@gabrielbarrantes6946 well, it can either mean beating them in a fist fight, or getting more correct answers than they can. Im not sure which one though🤔
@icaromendes1250
@icaromendes1250 4 күн бұрын
If AI had feelings it would definitely being hurt by this insult
@tainicon4639
@tainicon4639 4 күн бұрын
PhD students are also still learning. How does it compare to the pissed off post doc who’s been stuck in academia for 15 years after he graduated…
@romangeneral23
@romangeneral23 4 күн бұрын
It still can't count how many r's in strawberry. I think we good for a while...
@vasiovasio
@vasiovasio 4 күн бұрын
I too hope the Sarcasm hold us above the water... at least for a week or too! 😂😂😂
@itsdakideli755
@itsdakideli755 4 күн бұрын
It can...
@deep.space.12
@deep.space.12 4 күн бұрын
more likely a limitation from how the tokenizer breaks the word down (i.e. it's not aware of individual characters), than something fundamentally wrong with the model itself.
@hypno5690
@hypno5690 4 күн бұрын
there are two r's in strawberry though. There are also three r's and one r.
@jimmydesouza4375
@jimmydesouza4375 4 күн бұрын
How many r's are there in strawberrry though?
@ThisIsNotAUsername-v3o
@ThisIsNotAUsername-v3o 3 күн бұрын
0:19 - it is now 100% proven that English is the hardest subject.
@ThisIsNotAUsername-v3o
@ThisIsNotAUsername-v3o 3 күн бұрын
Also this is O(#); that is, the number of prompts until an AI that can't count letters properly thinks its answer is correct.
@sansithagalagama
@sansithagalagama 2 күн бұрын
​@@ThisIsNotAUsername-v3o Did you mean zero
2 күн бұрын
I think it’s pretty amazing they managed to build the equivalent of an all knowing but also friendly and helpful person on stackoverflow considering the lack of real training data.
@bengrzybowski2487
@bengrzybowski2487 4 күн бұрын
I've been seeing people freaking out about this new model, "it's better than PHD humans at X,Y,Z!" where X,Y,Z basically amounts to data processing... like oh my god??? A computer can process data faster than a person???? WHAT???? lmao
@mrbuttocks6772
@mrbuttocks6772 4 күн бұрын
Literally any modern computer can process data 'faster' than a human brain. Because a human brain is doing a whole bunch of shit at once in ADDITION to that data processing, while a computer does far less at any one time simply maintaining its 'active' state and therefor has more processing power to allocate for useful computation.
@deividfost
@deividfost 4 күн бұрын
Not surprising, since most people hyping AI have no idea what a PhD actually is.
@rosco3
@rosco3 4 күн бұрын
"It can beat programmers in olympics" Yeah if given unlimited amount of submissions, those same issues that are either ENTIRELY on the web or every single concept is on the web already, most of those olympics are for undergrad students
@tambal40
@tambal40 4 күн бұрын
​@@deividfostit doesn't matter it's evolving fast in 10 years it will be better than humans at everything EVERYTHING
@eagleeagle7360
@eagleeagle7360 4 күн бұрын
Exactly, it's as if one were trying to compete with the calculator hahahahahahaha
@richbaird9407
@richbaird9407 4 күн бұрын
If only a PhD were about skills like programming and solving equations. Literally every PhD student uses solvers for anything more complex than basic calculus anyways. The challenge of a PhD is learning how to think about things in unique ways and pushing boundaries and exploring new possibilities.
@some_one
@some_one 4 күн бұрын
No no no you got it all wrong, you get a PhD to solve standardized questions on a test!
@o1-preview
@o1-preview 4 күн бұрын
it has learning tokens now, wait another 2 models and get back to me
@VisionaryPathway
@VisionaryPathway 4 күн бұрын
@@o1-preview facts
@bartekb4191
@bartekb4191 4 күн бұрын
There are too many PhDs with closed minds out there for it to be true...
@pavlinggeorgiev
@pavlinggeorgiev 4 күн бұрын
@@o1-preview just another 2 models bro ... trust me
@TonyCecala
@TonyCecala 4 күн бұрын
They may replace PhDs. But never will they approach your PhD in sarcasm.
@nerlind
@nerlind 4 күн бұрын
If I have learned anything...everything is a few models away
@mananshah3248
@mananshah3248 4 күн бұрын
Try prompting it to write the office starting scene.
@soulsmith4787
@soulsmith4787 4 күн бұрын
Have you seen Neuro on Twitch? That little AI is the master of sarcasm. It's so strong that you can even tell despite the monotone tts.
@alevyts3523
@alevyts3523 4 күн бұрын
They can replace PhDs. In the sense that they can answer standard questions that a PhD can answer in theory.
@CyanRooper
@CyanRooper 4 күн бұрын
​@@soulsmith4787 you mean that AI loli Vtuber that sings songs like Bury the Light and Never Gonna Give You Up?
@DETahaX
@DETahaX Күн бұрын
"officer hardass" kills me every time with that picture 😭😭
@christopherchilton-smith6482
@christopherchilton-smith6482 3 күн бұрын
I've been having a blast with it. I used gpt4 to setup the bare bones of a mud-like text game, I've got a compass in every room showing the direction of exits, inventory, can equip and unequip items, drop items from inventory, pick them up, place monsters, really simple combat (saving the in depth stuff for later) but what I couldn't do with gpt4 or gpt4o was make a top down map that shows all the rooms and their connections in relation to each other just using unicode characters. No matter how I tried to break the problem down and describe it I just couldn't get useful code. o1 produced the code and put in a legend. I'm talking with it about branching dialogue solutions and think it may be able to help me import TWINE exports as json as a solution for doing branching dialogue. I litteraly could never have done any of this without these tools, I'm in love.
@joshroberts8944
@joshroberts8944 4 күн бұрын
This is concerning, it took the AI over 10,000 attempts with access to every relevant example on the internet during a contest to get gold lmao
@maxave7448
@maxave7448 4 күн бұрын
It basically tried everything until somwhing worked lol
@StickzDev
@StickzDev 4 күн бұрын
Like dr strange searching through every possibility to win against Thanos
@J-Kimble
@J-Kimble 4 күн бұрын
@@maxave7448 We're getting better at making software that throws sh*t on the wall and sees what sticks. Also known in the human world as a sh*tty programmer.
@genpotrait2274
@genpotrait2274 4 күн бұрын
Its not about those 10000 attempts, but how long it takes.
@fernandoacostaylara2586
@fernandoacostaylara2586 4 күн бұрын
@@genpotrait2274 Not really, its not viable to run 10000 attempts. In reality it won't know which scenario is the correct one
@evanseka4054
@evanseka4054 4 күн бұрын
"A car won't take your job, but another horse driving a car will." That hit way harder than it needed to.
@jamaludeenameen5361
@jamaludeenameen5361 4 күн бұрын
I dont understand it, please explain
@arxzhh
@arxzhh 4 күн бұрын
@@jamaludeenameen5361this new technology won’t take your job, but someone who knows how to use that technology will, not the machines itself.
@HessW
@HessW 4 күн бұрын
​@@arxzhh No, you got it wrong, it's even deeper, than you thought. The meme is about everyone who is talking like that. Horses had a job to transport humans, then a car appeared, as we all know, horses don't drive a car
@VitorCosta-n2m
@VitorCosta-n2m 4 күн бұрын
@@HessW No, wronger, it's even deeper. The car with his horsepower would bestow the horse, revealing a zero sum. Which after would divide the AI capability of coding.
@RedactedBrainwaves2
@RedactedBrainwaves2 4 күн бұрын
No worries guys. Afghanistan still has a big market for horses.
@midicine2114
@midicine2114 4 күн бұрын
Fuck it, I’m becoming a plumber. I’m also tired of these “snake game” examples. It’s just a glorified google at that point. Tons of snake examples on the web.
@dsfs17987
@dsfs17987 4 күн бұрын
and they mostly suck, which is what this "ai" is using to teach itself, garbage in - garbage out
@nuvotion-live
@nuvotion-live 4 күн бұрын
I laughed out loud at these coding demos
@univera1111
@univera1111 4 күн бұрын
Iv already given up on programming. And just on how to use already created softwares.😢😢😢
@SMGA14
@SMGA14 4 күн бұрын
Buddy, the robots will be the plumbers, no job is safe plus you're not guaranteed to be a plumber since the workforce will be saturated from all the people that lost their jobs turning into plumbers
@GeneralKenobi69420
@GeneralKenobi69420 4 күн бұрын
@@SMGA14 Nah, robots are California tech bro copium. Trade jobs are mostly safe for the next 20 years
@4RILDIGITAL
@4RILDIGITAL 3 күн бұрын
The potential of AI is indeed vast yet it falls short at times. In the end, it's a tool, at least for now.
@johnsmith1953x
@johnsmith1953x 4 күн бұрын
*How many 'r' characters are in the word "strawberry" ?* GPT-4 : TWO!! GPT-o1: "I have the answer for realsies, but it'll cost you $2,000"
@kindlin
@kindlin Күн бұрын
Strawbery obviously has 2 R's, idk what all the hubbub is about....
@codeaperture
@codeaperture 4 күн бұрын
Ah! 0 days since AI again?
@douwemusic
@douwemusic 4 күн бұрын
Spoiler alert-this will happen every time Fireship uploads about AI
@Tozu25
@Tozu25 4 күн бұрын
Many people are forgetting for some reason that its not only affecting developers. AI will also take doctors, engineers, architechts, creators, actors, editors, pretty much everyones jobs. It will be mass unemployment = no livable society. Why should anyone be excited and be joking? Now this is what’s should be concerning, nothing else. We are witnessing the start of something really bad.
@mr.nixtheboarddrawer1175
@mr.nixtheboarddrawer1175 3 күн бұрын
​@@Tozu25 people don't want to work thats why
@Tozu25
@Tozu25 3 күн бұрын
@@mr.nixtheboarddrawer1175 Well, the possible future products made by AI are not gonna be handed for free to you, unless society becomes socialist, and I don’t think that’s any more good.
@ThreefieldsMedia
@ThreefieldsMedia 4 күн бұрын
Hearing a slight raspiness in Fireship's voice is a subtle reminder that it is not AI-generated yet.
@diegogarcia.57
@diegogarcia.57 4 күн бұрын
Didn't someone else close his voice and he said that he didn't minded?
@unholycrusader69
@unholycrusader69 4 күн бұрын
*Yet.*
@w花b
@w花b 4 күн бұрын
Or maybe that's a sign this video was... For the first time
@Ainigma
@Ainigma 4 күн бұрын
prompt: add raspiness, increase by 15.000%
@o1-preview
@o1-preview 4 күн бұрын
fireship cloned his own voice waaaay back when he had very few subs and used it for a couple of vids
@TheGrandChelem
@TheGrandChelem 4 күн бұрын
Is it just me who feels so sad that words are disappearing from the internet ? In this video, the word drug is censored just to please an algorithm. The other day I even saw someone who censored the word hate in «she hates being called wifey» smh
@tacitozetticci9308
@tacitozetticci9308 4 күн бұрын
You're lucky the word "wifey" survived. Gotta cherish what we have.
@hypno5690
@hypno5690 4 күн бұрын
even scarier, we are now using words like "unalive" in real life which stems directly from online advertising censorship. Corpo speak
@turolretar
@turolretar 4 күн бұрын
*t’s n*t j*st y*o b*d 😢
@livinghuman2298
@livinghuman2298 4 күн бұрын
The other day i replied to a comment with 100% innocent sentence, no reason to censor it, yet it was deleted, soon we won't be able to say anything.
@khhnator
@khhnator 4 күн бұрын
that's just how language works. internet is not being special here
@Beautiful-ModelBarbara-Hut-s1j
@Beautiful-ModelBarbara-Hut-s1j 3 күн бұрын
I almost want fireship to stop posting. This channel is scaring the shit out of me and my career. This is fucking nuts
@legendoffeiyouknow709
@legendoffeiyouknow709 2 күн бұрын
Did you even watch the last 2 minutes? Moral of the story: the "deep thinking" is basically ChatGPT prompting your prompt to create a more accurate answer. But it managed to fail creating a console based game.
@lyznav9439
@lyznav9439 2 күн бұрын
Meh
@clarityhandle
@clarityhandle 2 күн бұрын
​@@legendoffeiyouknow709 Its still running off the exact same base model though? Its just a new update to its prompt algorithm which makes it that much better. This is just a way to reduce hallucinations etc. Wait for GPT-5 to launch at the end of the year and then make your decision on whether its gonna make a difference to your job soon or not. Because this isn't a generational upgrade just a small update
@cartapo2669
@cartapo2669 2 күн бұрын
@@legendoffeiyouknow709 You have 5 years MAX
@danielrodrigues4903
@danielrodrigues4903 Күн бұрын
​@@legendoffeiyouknow709 You're talking to a bot
@Jackson_Zheng
@Jackson_Zheng 4 күн бұрын
0:25 Man, that clip was perfect lol
@cryptaveli
@cryptaveli 4 күн бұрын
They took our jerbs!
@Douchebagus
@Douchebagus 4 күн бұрын
They Turk are Durrr
@robcz3926
@robcz3926 4 күн бұрын
took yer durr!!!
@JonathanHelvey
@JonathanHelvey 4 күн бұрын
Tuk yer jerbs !!!!!
@aarushsaboo1194
@aarushsaboo1194 4 күн бұрын
Yarrrrr haarrrr
@zoeherriot
@zoeherriot 4 күн бұрын
Make no mistake, they need that to happen to pay for the billions they’ve sunk into training these models. (It won’t work though).
@veenmikki27
@veenmikki27 4 күн бұрын
I used to be hopeful that AI could help me out a little through school but if this stuff’s already doing phd level physics I might not have school to finish
@Tmssef
@Tmssef 4 күн бұрын
Atm there is no point in studying.
@ryzikx
@ryzikx 4 күн бұрын
calculators can do arithmetic better than any humans why learn math ?
@paegr
@paegr 4 күн бұрын
@@ryzikx Now the calculator can automatically do every job on Earth at 100 times the speed you can for 1/1000th of the cost, so you have no reason to be alive according to Capitalism
@MintBunHunter
@MintBunHunter 4 күн бұрын
@@ryzikx its cool
@oioio-yb9dw
@oioio-yb9dw 4 күн бұрын
​@ryzikx because then the AI realises you are stupid and it will tell you that 2 + 2 = 5 and so on, you will end up becoming it's dog.
@GSBarlev
@GSBarlev 4 күн бұрын
This is a huge leap forward in Sam Altman's ability to separate AI bros from their trust funds and crypto hodlings.
@Tozu25
@Tozu25 4 күн бұрын
Many people are forgetting for some reason that its not only affecting developers. AI will also take doctors, engineers, architechts, creators, actors, editors, pretty much everyones jobs. It will be mass unemployment = no livable society. Why should anyone be excited and be joking? Now this is what’s should be concerning, nothing else. We are witnessing the start of something really bad.
@spaghettiking653
@spaghettiking653 2 күн бұрын
​@@Tozu25I'm not sure whether this will really replace doctors and stuff like that. Being a surgeon or dentist requires very fine motor control, extremely reliable expertise and knowledge, accountability, personality, etc., so as to not make a single mistake and to always navigate the patient's ill state perfectly. AIs and robots, which at this stage are far from known for their rigid foundations in any of these things, definitely have no ability to take any of these jobs. Moreover, if we really do eventually "solve" jobs, so that no one ever needs to work again, then we can rejoice at the fact that no one will be required to toil again. Things like UBI will become possible. The real doomsday scenario is if AI only succeeds in taking creative and artistic jobs, leaving humanity to do all the dead, manual labour. That is what I fear, not that doctors or actual trained professionals will be replaced.
@Tozu25
@Tozu25 2 күн бұрын
@@spaghettiking653 I was diagnosed by an AI chatbot when I got my paid sick leave. I told the AI my symptoms, and got questions and then a real doctor signed the digital document and left. So it's already happening. Similar to anything, the AI does the task and then someone checks the result. But it's good that you are critical about AI, and looking both ways. You are the first one out of anyone, and I've spoken to like 15 people. That tells about intelligence, in you.
@danielrodrigues4903
@danielrodrigues4903 Күн бұрын
​@@Tozu25 No, mass unemployment = new economic system and a break from the relentless capitalism dystopia we're experiencing. In big cities like London, regular new graduates can't even afford to buy houses on good salaries. The system is bullshit and needs to be torn down.
@MacCrunch
@MacCrunch 4 күн бұрын
The improvements are impressive, but there's still a lot to uncover about the true impact and capabilities of these models.
@Tozu25
@Tozu25 4 күн бұрын
Many people are forgetting for some reason is that AI will also take doctors, engineers, architechts, creators, actors, editors, pretty much everyones jobs. It will be mass unemployment = no livable society. Why should anyone be excited? We are witnessing the start of something really bad.
@RokeJulianLockhart.s13ouq
@RokeJulianLockhart.s13ouq 3 күн бұрын
​@@Tozu25 I disagree.
@Tozu25
@Tozu25 3 күн бұрын
@@RokeJulianLockhart.s13ouq Well, if an AI someday gets created which is equally as smart and conscious as a human, if not more, of course they can replace those jobs I mentioned as well. Edit: Before you mention, I know there is no such thing yet as a conscious AI and hopefully never will be. The speed of change in society would be so quick that it would mean hard times worldwide.
@RokeJulianLockhart.s13ouq
@RokeJulianLockhart.s13ouq 2 күн бұрын
@@Tozu25 LLMs are search engines, like Google is. They're nothing more than correlators. They're not a form of intelligence, as their confident incorrectness when they get stuck in recursive loops demonstrates.
@mrX666-s9p
@mrX666-s9p 2 күн бұрын
@@Tozu25 It is used as a tool stop being dumb you need human interaction even in programming it's not like I would give full access to an AI model to my business.
@yo-no9879
@yo-no9879 4 күн бұрын
1:19 good to see o1 is struggling big time with chemistry, gonna make a lot of chemists happy.
@lanceb9065
@lanceb9065 2 күн бұрын
I’ll be the 25th Chemist to give that a thumbs up 👍
@DEUTSCHWULF
@DEUTSCHWULF 4 күн бұрын
By the time I finish writing this comment, this model will already be outdated.
@harmez7
@harmez7 4 күн бұрын
what if I dont finish reading your comment?
@shivarajchangale47
@shivarajchangale47 4 күн бұрын
oh my god 😅😅
@wayne8797
@wayne8797 4 күн бұрын
Very true. All these ai models look amazing but once you have used it for anything besides asking it rudimentary stuff then it falls apart very quickly.
@michaelnurse9089
@michaelnurse9089 4 күн бұрын
But each version pushes further up against the rudimentary limit. The first cars randomly exploded and had to have horses travelling behind to carry extra fuel.
@Pfennigfuchs-z7v
@Pfennigfuchs-z7v 4 күн бұрын
@@michaelnurse9089You can’t equate past advances in some field with advances in a completely other one. Quite a few parameters are different. You can however try to formulate rules for technological advancements in general. Processes like these tend to follow a logistical curve and the question is at what point of the curve are we right now. I would argue we’re about to hit the plateau.
@Simonstoster
@Simonstoster 4 күн бұрын
​@@Pfennigfuchs-z7vAlso its just a confirmation bias. For every technological innovation there is a problem unsolved since decades
@Tozu25
@Tozu25 4 күн бұрын
@@michaelnurse9089 Many people are forgetting for some reason that its not only affecting developers. AI will also take doctors, engineers, architechts, creators, actors, editors, pretty much everyones jobs. It will be mass unemployment = no livable society. Why should anyone be excited and be joking? Now this is what’s should be concerning, nothing else. We are witnessing the start of something really bad.
@YaamFel
@YaamFel 3 күн бұрын
​@@Tozu25You have a fundamental misunderstanding of how LLMs work if you think they could ever replace engineers and doctors.
@mirrorsreflectyou
@mirrorsreflectyou 4 күн бұрын
But can this center a div?
@theterribleanimator1793
@theterribleanimator1793 4 күн бұрын
not yet. It can plagiarize the code for a snake game though.
@friendlyfox2189
@friendlyfox2189 4 күн бұрын
😂
@livinghuman2298
@livinghuman2298 4 күн бұрын
Cursor can, i think?
@CyanRooper
@CyanRooper 4 күн бұрын
But can it do this? *bends chair backwards*
@gramioerie_xi133
@gramioerie_xi133 4 күн бұрын
@@theterribleanimator1793 Why do you people always accuse it of ‘plagiarism’ like that even makes any sense
@hglbrg
@hglbrg 4 күн бұрын
OpenAI needs money, releases some reskinned GPT3.5 that asks "are you sure" secretly and send the response after that to the user to maintain hype, investor money and altmans job. Same bubble. Same hot (AI)r.
@justanotherchannelname1273
@justanotherchannelname1273 4 күн бұрын
Yeah, this was plain dissapointing. I was expecting some major architectural change with all the hype around 'Q*' but this is just another chatbot except it's trained to ask itself 'are you sure about that?' a couple of times and provide long COTs with a fancy UI to hide the complexity from users who don't know how to prompt worth a dang.
@DavidJames-lz8js
@DavidJames-lz8js 4 күн бұрын
(AI)r = Air. I see what you did there 😏
@gramioerie_xi133
@gramioerie_xi133 4 күн бұрын
@@justanotherchannelname1273How in the hell is consistently beating human experts in several abstract fields not impressive to you
@indigitalcreativity4500
@indigitalcreativity4500 4 күн бұрын
​@justano so what you expect from new AI, ?
@dvoiceotruth
@dvoiceotruth 3 күн бұрын
Altman write a for loop on chatgpt UI
@hendrx
@hendrx 4 күн бұрын
Remember guys, we nerfed o1 when the hype was over, but o2 is gonna make a killing
@otpezdal
@otpezdal 4 күн бұрын
Please, write the same statement but for o3 in the future
@notKhalid
@notKhalid 4 күн бұрын
to be clear, o1 are not actually new models themselves, they're built on top of gpt-4o models with extended inference abilities.
@sashub2593
@sashub2593 4 күн бұрын
well, what you just said is quite obvious because if we think about it, no company is going to redesign the entire algorithm again to come up with a new model.
@tzardelasuerte
@tzardelasuerte 4 күн бұрын
Correcto. Now in a few months gpt5 is coming out with all these advancements.
@David-gu8hv
@David-gu8hv 3 күн бұрын
Doesn't it use feed back now? Adding "one little change" can have profound effects...
@andrewcampbell7011
@andrewcampbell7011 Күн бұрын
“It’s basically just like GPT4 with the ability to recursively prompt itself”. Exactly. We are in the parlor tricks phase of this hype cycle.
@co3udatel
@co3udatel 4 күн бұрын
3:32 Ну за фруктовый сад лайк однозначно
@cbn1362
@cbn1362 4 күн бұрын
It amazes me every time how I think about this channel was all about angular and firebase back in the days and where it is now.
@crackwitz
@crackwitz 4 күн бұрын
That's like a startup pivoting when they discover what the customers really need
@complexity5545
@complexity5545 4 күн бұрын
Both, angular and firebase, are currently being re-obsoleted (by react, htmx, and svelte (or some combination)). Firebase has been dead about 8 years after it was born. Most wise programmers never used Firebase.
@dvoiceotruth
@dvoiceotruth 3 күн бұрын
That's what i stumbled across. A channel supposed to be firebase documentation is doing all crazy stuff in th name of firebase. How could that be. Thank you now I get it.
@gnarpow
@gnarpow 3 күн бұрын
No kidding! lol
@tamizharasanbe
@tamizharasanbe 4 күн бұрын
"A car won't take your job, but a horse driving a car will" .... damn!!!! deeeeeeeeeeeppp
@sentinelav
@sentinelav 4 күн бұрын
I expected something crazy, but when I saw the benchmarks, they're really not that groundbreaking. o1's reasoning token paradigm serves as a middle layer for handling complex instructions, so it's more internally organised, but that doesn't necessarily mean the underlying architecture has substantially improved. Coding, maths and science are all topics where handling information in a purely linguistic context by default is detrimental, so it naturally follows that it would be more effective to logically deconstruct problems. However, you might see similar improvements with any other LLM by manually creating an intermediary prompting stage. This is still an improvement, but remember, a significant leap ahead at this stage would mean something as groundbreaking to transformers, as transformers were to RNNs, and this is nowhere close. Make no mistake, this is part of the plateau. There will still be progress, and we should be looking to concentrate that towards building tools to aid developers, rather an attempt to replace them.
@danielrodrigues4903
@danielrodrigues4903 Күн бұрын
We should be aiming to replace everyone. Always aim high.
@kumarapillay3122
@kumarapillay3122 4 күн бұрын
before gpt used to be bad at doing even basic force questions. But to o1, i gave my fluid mechanics problem and it was able to do it and i didn't even upload the diagram pictures. Its gotten really good now
@nejiabdurrahmen
@nejiabdurrahmen 4 күн бұрын
3:23 you can really feel the frustration, amazing
@MustafaETKER
@MustafaETKER 4 күн бұрын
How can be someone so funny and so informative at the same time in just 5 minutes
@turolretar
@turolretar 4 күн бұрын
Something, not someone
@diegogarcia.57
@diegogarcia.57 4 күн бұрын
Humans are the original AI
@tzardelasuerte
@tzardelasuerte 4 күн бұрын
And so biased. No our jobs are never going away!!!! 😡😡😡😭😭😭
@MustafaETKER
@MustafaETKER 4 күн бұрын
@@turolretar wdym
@hahahano2796
@hahahano2796 4 күн бұрын
When given 10,000 chances it finds the one monkey who can write Shakespeare.
@rise9489
@rise9489 4 күн бұрын
Even a blind squirrel will eventually find a nut
@0xhenrique
@0xhenrique 3 күн бұрын
It was on the 49th percentile with just one try. Most of developers can't even get into the competition, what's your point? That's literally programming Olympics, it's not like your average web dev doing the contest. Being on the 49th percentile means that o1 was better than half of the Olympic level programmers. That's certainly something. Denying is pure cope
@aabbvcddeeffaass6216
@aabbvcddeeffaass6216 3 күн бұрын
if the monkey can finish 10000 trys in 10 minutes, I don't see a problem.
@tabiserebour5912
@tabiserebour5912 4 күн бұрын
Whenever i see your video notifications, i start laughing even before watching the video😂
@ArifBillahOnGoogle
@ArifBillahOnGoogle 4 күн бұрын
Hi Jeff, I'm writing this comment to delightfully let you know that I absolutely like the way you do the "last kick" at the end of your videos sometimes. Beautifully crafted kick! Thanks. ❤
@toadlguy
@toadlguy 3 күн бұрын
The is the best overview of o1 I have seen yet 😊😊😊
@ryzikx
@ryzikx 4 күн бұрын
didnt expect nikocado cameo on Jeff Fireship's channel!
@imsleepy620
@imsleepy620 4 күн бұрын
Fireship's definitely my favorite horse influencer
@Adambd99
@Adambd99 4 күн бұрын
most based comment ever
@SpragginsDesigns
@SpragginsDesigns 4 күн бұрын
0:23 was a legit lol moment. Oh wait, so was most of the video.
@SkegAudio
@SkegAudio 4 күн бұрын
came here to take a break from coursework, that avocado bit had laughing way too loud for a library 😂
@SpragginsDesigns
@SpragginsDesigns 4 күн бұрын
@@SkegAudio Nobody does the developer / comedy / memes / but still informative style he has. He's one of those "never miss a video" channels I have to watch on the spot.
@NeonVisual
@NeonVisual 4 күн бұрын
When we eventually get AGI it will be so expensive to run that we will only be able to turn it on for a fraction of a second to resolve all of humanity's problems. It will then take 10 years to work through all of the data created.
@sandeepnautiyal3070
@sandeepnautiyal3070 4 күн бұрын
"And O stands for ohh sh*t we are gonna d*e" is so apt and hilarious lmao
@kili20394
@kili20394 4 күн бұрын
As a coder and developer, I have no fear of "LLMs" taking my job. A lot of the stuff I code is too specific and niche for an LLM to figure out without having hella bugs.
@sajeucettefoistunevaspasme
@sajeucettefoistunevaspasme 4 күн бұрын
as a 0.1x developper I am very afraid
@djs-vids
@djs-vids 4 күн бұрын
agreed, same
@fullstackweebdev4855
@fullstackweebdev4855 4 күн бұрын
To replace me, the customer would need to know what they want and accurately describe it to an AI. I’m perfectly safe.
@djs-vids
@djs-vids 4 күн бұрын
@@fullstackweebdev4855 and then be able to debug the trashy code AI produces
@byron_00
@byron_00 4 күн бұрын
@@fullstackweebdev4855 well said. I push back on the garbage requirements I receive and help point the customer in the right direction for something more sane. A.I. will happily write a clucking fsck.
@98ahni
@98ahni 4 күн бұрын
As long as it can't solve the _"Okay, so hear me out."_ problems the client has with all the help of _"I'm sure you'll figure it out!"_ and (of course) no further details, I think my job is pretty safe.
@user-sb5vt8iy5q
@user-sb5vt8iy5q 4 күн бұрын
Ok so when will they replace HR?
@albercode9562
@albercode9562 2 күн бұрын
My only concern is that AI goes full apocalypse mode after spending 2 days with my manager
@AyushRaj-w4j
@AyushRaj-w4j 4 күн бұрын
1:40 i never expected to wake up and see a bot having higher rating than me.
@turolretar
@turolretar 4 күн бұрын
A horse walks into a bar. The bartender asks - why the long face?
@strategistaow3520
@strategistaow3520 4 күн бұрын
If ai can replace programmers, it can replace anyone
@szymoniak75
@szymoniak75 4 күн бұрын
yup!
@CyanRooper
@CyanRooper 4 күн бұрын
Spy from TF2: "It could replace you, it could replace me. It could even replace..."
@sajeucettefoistunevaspasme
@sajeucettefoistunevaspasme 4 күн бұрын
@@CyanRooper "it could even be your mother !"I haven't seen it for a while
@DoctorRainer
@DoctorRainer 4 күн бұрын
Sadly, we don't leave in fantasy world and this thing will be massively disappointing
@ineeddaname2
@ineeddaname2 4 күн бұрын
Not really. Coding has tons of sample data to train on. There's tons of obscure roles or tasks in the business world that could be replicated if the right training data was available but it isn't since it's only in some guys head
@mrkingsquid20
@mrkingsquid20 4 күн бұрын
so cooked I'm watching this during comp sci class
@n-o-i-d
@n-o-i-d 4 күн бұрын
Paying off student loans later while having no job sure sounds like a lot of fun
@hvr8463
@hvr8463 4 күн бұрын
Too late for a refund?
@notme3987
@notme3987 4 күн бұрын
Have faith brother, see this AI scare as a good thing.
@turolretar
@turolretar 4 күн бұрын
What’s cooking? Where’s mine
@NoName-cd5ft
@NoName-cd5ft 4 күн бұрын
Me too 😢. Does anyone have any suggestions about how to stay relevant.
@Erickhetfield
@Erickhetfield 4 күн бұрын
The strawberry question is imprecise, so it's the answer. There are 2 Rs in Strawberry, it's not wrong. If you ask "Count how many Rs in Strawberry" or "How many R in Strawberry IN TOTAL" it will tell you 3.
@SafwenHafsawy
@SafwenHafsawy 4 күн бұрын
every new model comes out crushes coding benchmarks then the hype dies down
@bennythetiger6052
@bennythetiger6052 4 күн бұрын
I love how, by this point, people should've already realized they shouldn't freak out when new AI DLC drops, yet it all follows the same hype trend. They keep being like "oh, but this time it's for real", but until we see a real and fair example of it actually doing all these revolutionary things, it's illogical to assume things will be any different. It's not copium, it's just a matter of proof of concept
@DeusExRequiem
@DeusExRequiem 4 күн бұрын
2:08 the reason many people are moving over to Claude is because Claude isn't censored and is more useful for things like generating erotic content and conversations that don't sound like you're talking to HR, which is all that the majority of people care about. The o1 model is going to be great for jobs, it's a little more reliable for perfect answers, but the problem remains that corporations want something that's specifically useful and not generally useful, a lot of them have internal systems and custom setups that don't generalize, and they worry about data leaks, and would prefer the ability to run all of this in-house. The majority of AI users are fine with some generalization, can't afford to run the best ones in-house, and want it uncensored. Unless Microsoft can stay ahead, people will move on the moment something almost as good comes out that isn't censored, and Microsoft will be stuck catering to corporations who have demands.
@JanVerny
@JanVerny 4 күн бұрын
You're thinking about this all wrong. Consumer software is not where the money is at. Most profitable MS divisions are all centered around business products. They obviously want to sell AI to the business first and foremost. If you thought MS expects regular consumers to buy the Copilot+ computers, you're dead wrong. They don't care if literally no one buys it. Because business will eat that shit up. And big companies will pay insane money to get as you say their own specialized AI solutions. While things like Claude, will struggle to finance anything after they run out of venture capital.
@esarmiento7
@esarmiento7 4 күн бұрын
I just asked Claude for erotic content and he treated me like a pervert
@hastyscorpion
@hastyscorpion 4 күн бұрын
You think the reason most people use Claude is for “ erotic content” ? Dude you need to go outside and talk to actual humans more
@nousquest
@nousquest 4 күн бұрын
Claude is much more censored. I can't get it to help me with the CTFs in my ethical hacking course.
@freeottis
@freeottis 4 күн бұрын
In my experience Claude censors more. I tried asking it a question about what a stolen vehicle could be used for (a screenshot from a driver’s license exam) and it said nope. Chatgpt answered it.
@Im_Ninooo
@Im_Ninooo 4 күн бұрын
4:06 ayo is that Bogdan? 😳
@_demosthenes
@_demosthenes Күн бұрын
it's funny how every time people start to realize how limited AI is (to the point that they're literally stealing copyrighted content to train them because its just so resource intensive) they announce some "magic ai model that, trust is, this time really is awesome!" and then its immediately shown to have the same exact problems just with some flavoring
@adianblabla
@adianblabla 4 күн бұрын
Chat GPT was supposed to replace us, yet here I am cursing at chat GPT for spewing hallucatinations when I press the space keyboard after I finish coding a line.
@BrandonAaskov
@BrandonAaskov 4 күн бұрын
lol officer hardass with that image 😂
@officebatman9411
@officebatman9411 4 күн бұрын
whos that?
@therealkon_
@therealkon_ 4 күн бұрын
@@officebatman9411 Officer Hardass
@ryzikx
@ryzikx 4 күн бұрын
@@officebatman9411 someone who got fired for doing certain activities when she shouldnt have been
@JM-st1le
@JM-st1le 4 күн бұрын
😂
@bigbigdog
@bigbigdog 4 күн бұрын
@@ryzikx doing certain activities to the whole goddamn police dept
@naishiuan1
@naishiuan1 4 күн бұрын
o1ways 2 steps ahead!
@genzod-i6e
@genzod-i6e 4 күн бұрын
In those competitions were they using new challenges or old ones that the AI might have gone through during training?
@flarebear5346
@flarebear5346 4 күн бұрын
They were using old ones lmao
@veloce5491
@veloce5491 4 күн бұрын
this is always my question but the answer is always hard to find. where would they even get all these completely original coding questions to test these models on?
@ZeerakImran
@ZeerakImran 2 күн бұрын
I wrote this: How many “r” are there in “strawberry”. And it got it correct. 3. First try. Don’t forget “r” is also short for “are”. And r’s could imply “r’s”. That may seem silly but the ai is used to correcting our mistakes and typos all the time. Like the sentence i just wrote before this one. Or my first one where I didn’t place a question mark. If it doesn’t know or isn’t sure, it does make up some nonsense instead of saying I don’t know. Just like interacting with kids. Doesn’t mean they’re not intelligent. Ask a 4 year old or 5 year old how many r’s there are in strawberry and see how long they take to do it in their head. They don’t have a great short term memory, just as humans don’t. Doesn’t make us any less intelligent than ai. Works both ways. If you can’t understand why it said what it said, you have to reserve making wide sweeping statements or judgements on that thing. Whatever it is. Never know why that might be needed and it comes back to show you how wrong you were.
@dineshkumarraman6671
@dineshkumarraman6671 4 күн бұрын
AI won't take your job but a man with AI will take everyone's job!
@existenceisillusion6528
@existenceisillusion6528 4 күн бұрын
The core innovation driving o1 was made public about 6 months ago. And it really works, but we still have a long way to go. I tried it on 2 challenging problems, and it almost didn't suck.
@markmacharia5187
@markmacharia5187 2 күн бұрын
where is it posted?
@nicknelson1975
@nicknelson1975 4 күн бұрын
It can build a game of Snake because there are thousands of open source examples online.
@w.mcnamara
@w.mcnamara 4 күн бұрын
This a million times over lmfao. If only the people hyping ai through the moon knew even the most basic aspects of how llms work
@iraniansuperhacker4382
@iraniansuperhacker4382 4 күн бұрын
@@w.mcnamara I just remind them of how crazy their ideas are. I remind them that they are claiming that linear algebra and statistics have literally become living beings and can now reason like humans. The hype is just silly at this point, I just ask them how its possible literal math became conscious and I never get a reply back.
@micca971
@micca971 4 күн бұрын
@@iraniansuperhacker4382 Probably the same way a few neurons sending singals back and forth can become conscoius aka we don't know. We don't know what consciousness is, what do you need for that or how it comes to exist. Maybe even math can become conscious who knows. That said I'm not saying any AI is conscious or even that it will ever reach consciousness, just that we don't know if it is possible.
@iraniansuperhacker4382
@iraniansuperhacker4382 4 күн бұрын
@@micca971 I would go as far as to say that math being processed on a silicon chip becoming conscious is physically impossible no matter how complex of a system it is. This is like saying if we write a sufficiently advanced piece of literature it will eventually be able to think or reason in some way. It just fundamentally doesnt make any sense.
@micca971
@micca971 4 күн бұрын
@@iraniansuperhacker4382 that's not same at all, a piece of literature does not compute or process anything it does not receive and manipulate energy, therefore it cannot do aynthing on its own. If however you said a lot of monkeys were writing books, then possibly the entire collective of monkeys writing books (a lot of them, trillions or quadrillions at least or maybe more) can become conscious or at least exhibit intelligent behaviour as we see with the current AI. Aka it's not just about complexity, it's about manipulating energy and data using some logic. Also keep in mind this is all very hypothetical, but you can't say it is fundamentaly wrong. We just don't know.
@nawawishkid
@nawawishkid 4 күн бұрын
3:17 I've just tried asking the o1-preview model `How many "r" in the word strawberry?`, it answered 3 "r"s correctly at first try. Then in the same chat, I switched to 4o model, it said 2. 🤷 Then switched back to o1-preview, it even apologized for the mistake in the previous answer made by 4o. Pretty smart to me. 🎉
@rumfordc
@rumfordc 4 күн бұрын
then you're not very smart
@deleted-something
@deleted-something 3 күн бұрын
Okay but at this rate the next model with need .05% of the worlds energy to solve a question
@theNovaStream
@theNovaStream 3 күн бұрын
POV Microsoft: Add this model inside the Copilot+ PC
@PatrickHoodDaniel
@PatrickHoodDaniel 4 күн бұрын
My prompt for the number of "r"s in the word "strawberry" got it right.
@Scrubzei
@Scrubzei 4 күн бұрын
Mine didn't
@PatrickHoodDaniel
@PatrickHoodDaniel 4 күн бұрын
@@Scrubzei interesting.
@purplebuckwheat
@purplebuckwheat 4 күн бұрын
Even GPT-4 legacy got that one right for me.
@rumfordc
@rumfordc 4 күн бұрын
@@PatrickHoodDaniel LLM's don't give consistent answers because 1) they're rate limited and the amount of compute spent changes the answer and 2) they have a 'temperature' parameter which is effectively just RNG when selecting from the top token candidates 3) every single character you type is a completely new input so something as simple as leaving out a question mark will potentially get a different answer
@chr0ne692
@chr0ne692 4 күн бұрын
I am pretty sure GPT4 also prompts itself somewhat at least because I am remember one time it accidentally showed me it's internal prompting. It said something like "user wants to understand blah blah..." then abruptly switched to explaining what I wanted.
@Caphalem
@Caphalem 4 күн бұрын
You are correct, this is something ChatGPT does. It basically tries to create a more sophisticated prompt out of your prompt before actually addressing it. However, what these new models essentially do is check their answer and try to sanity check themselves several times before giving you the final response.
@chr0ne692
@chr0ne692 4 күн бұрын
@@Caphalem I figured something like that. I just thought this distinction wasn't totally clear in the video, or maybe I wasn't paying enough attention. Thanks for the reply
@Tekaisuwu
@Tekaisuwu 4 күн бұрын
3:28 the Chain of Thought isn't hidden, you just have to click on it lol
@Fireship
@Fireship 4 күн бұрын
That's not the raw chain of thought, just a summary. This is what OpenAI says "After weighing multiple factors including user experience, competitive advantage, and the option to pursue the chain of thought monitoring, we have decided not to show the raw chains of thought to users"
@Tekaisuwu
@Tekaisuwu 4 күн бұрын
@@Fireship ohh ok gotcha
@waltersumofan
@waltersumofan 4 күн бұрын
all this energy to just not pay employees properly, it's crazy
@radektheplayer
@radektheplayer 12 сағат бұрын
True
@geovane19
@geovane19 2 күн бұрын
as long as LLMs can't translate the bizarre requirements from a actual client into a functioning product, we're good
@user-mchlnwekrrrwq
@user-mchlnwekrrrwq 4 күн бұрын
tbh Sam has always been physically scary
@wojtek987
@wojtek987 4 күн бұрын
What I was disappointed most about with this new "thinking" preview model is that it still has almost no awareness of anything relating to itself. Whenever I ask a question about itself, its hallucination rate is like 85%
@yousefsuliman
@yousefsuliman 4 күн бұрын
This is likely intentional. They said they are intentionally hiding the chain of thought from the users.
@Easternromanfan
@Easternromanfan 4 күн бұрын
I mean yeah it still is a LLM
@latt.qcd9221
@latt.qcd9221 4 күн бұрын
Why would it be aware of itself? It's just an LLM.
@tribeholz23
@tribeholz23 3 күн бұрын
I‘m pretty sure that chatgpt is not aware of any concept it generates as output. Pattern recognition and awareness are two distinct things.
@will_abule
@will_abule 4 күн бұрын
But can it be monetised?
@HanzDavid96
@HanzDavid96 3 күн бұрын
The statement, that a gpt that can prompt itself is not game changing will age like milk. If you can improve output quality using more calculation power, you create the basics for a successfull reinforcement learning circle!
@KyleHarrisonRedacted
@KyleHarrisonRedacted 2 күн бұрын
First thing I asked o1 was what the difference between o1 and 4o was. It ran in circles for a little bit and ultimately asked me for more information. I said “it’s you. It’s gpt models” and it took like 25 more seconds of thought and came up with the answer it had no idea what I was talking about because its training was capped to Sept 2023. I then gave it a prompt about colostomy bags, and it’s only here in this video I’m now learning about that these steps I’m getting it to take might one day cost me extra money. Well nuts to that, the subscription is already expensive enough and barely justifiable. Guess I’ll stick with 4o for most things
@azmo_
@azmo_ 4 күн бұрын
So this is just langchain but by openai?
@user-zu6ts5fb6g
@user-zu6ts5fb6g 4 күн бұрын
Yes
@Tozu25
@Tozu25 4 күн бұрын
The fact that everyone is forgetting for some reason is that AI that is taken too far will also take doctors, engineers, architechts, creators, actors, editors, pretty much everyones jobs. It will be mass unemployment = no livable society.
@christopherchilton-smith6482
@christopherchilton-smith6482 2 күн бұрын
@@Tozu25 if we can automate everything, why wouldn't we just leverage democracy to distribute surplus resources to everyone?
@Tozu25
@Tozu25 2 күн бұрын
@@christopherchilton-smith6482 Because the billionaires have a lot of control over politicians and money says what to do. Especially in the USA. So yes, I doubt they are going to be willing to lose control. The corruption is also a part of it, that exists everywhere. Only one corrupt is needed for things to happen, but usually there are more and more. Also life could become kind of boring. Usually people have some long term goals in life, then they would be what, become leaderboard #1 in a game... or do a stupid TikTok challenge. I am saying that I would get bored, because we wouldn't be able to as much as we want, whenever we want, however we want. The planet can't handle a sustainable living place for us, if all would live like millionaires. All we would do, is live boring, reagular lives with no real hard objectives or goals.
@Tozu25
@Tozu25 2 күн бұрын
@@christopherchilton-smith6482 Because billionaires and the rich top 0.1% have a lot of control over politicians. Especially in the USA. It's happening all the time now, and I don't think they are willing to let go of that power in the future either. Then there is corruption, and it happens everywhere, all it needs is one person, but of course there's a lot.
@christopherchilton-smith6482
@christopherchilton-smith6482 2 күн бұрын
@@Tozu25 you can only do whatever you want whenever you if you're rich and they don't live boring lives. If we distributed the surplus resources so that "everyone lived like millionaires" then everyone would actually be free to pursue goals that are meaningful to them instead of being trapped in a boring job. I don't believe in conspiracy theories, while there is a lot of corruption our collective will via a vote still carries weight, we can and should vote to nationalize any business that holds an automated monopoly.
@Tozu25
@Tozu25 2 күн бұрын
@@christopherchilton-smith6482 Which part was a conspiracy?
@VeryUniqueRandomName
@VeryUniqueRandomName 4 күн бұрын
74% might sound like a lot for a non-technical person, but for those who know what is SLA and how hard to go from 99.9 to 99.99, 74% is not even worth looking. Though I have doubts that LLM models will ever reach 99%
@peterhorton9063
@peterhorton9063 4 күн бұрын
Right being 95 percent accurate in your compute is terrible for most things. Imagine 1/20 words you speak and interpret wrong while not even knowing they were wrong. Errors would compound all over.
@egodreas
@egodreas 4 күн бұрын
@@peterhorton9063 I'm sure it wouldn't be too bad. I suspect that most people would probably understand you just pineapple.
@Haise-san
@Haise-san 4 күн бұрын
​@@egodreasFor people yeah, but condoms for sure need it to be accurate for them to work and solve problems.
@marked75
@marked75 3 күн бұрын
based on what you said, I think this confirms that they are now at the phase where they're doing clever implementations of the LLMs and being more specific in what it should generate well. In my opinion this is a sign that the technology is maturing, and the real potentially world changing products are coming. But it may also be a sign that this technology is at it's peak, when you can't go up, you go side ways
@skufpan
@skufpan 4 күн бұрын
This model is essentially nothing more than a small training + special system prompt that mimics the thinking process, it is not new, something similar already exists for open models such as llama (Reflection) and the right clue can be easily found or compiled yourself. OpenAI has decided to warm up its audience
@beeronme7131
@beeronme7131 4 күн бұрын
No views, 15 comments. The boys are wild.
@chrisholland6366
@chrisholland6366 4 күн бұрын
I love eventual consistency
@rumfordc
@rumfordc 4 күн бұрын
views are a thousand times more frequent than comments, so youtube counts them in batches slowly over time as opposed to comments which are just counted normally as they come in. So there is often a noticeable delay in the view count when the video is first uploaded. i believe that's what the comment above mine is referring to as well.
@beeronme7131
@beeronme7131 3 күн бұрын
@@rumfordc Thanks for explaining :) I knew how this works (Tom Scott's video about this is great).
@xnlogical_1108
@xnlogical_1108 4 күн бұрын
Honestly at this point I don't trust ANY AI with ANY sort of code. 4o is literally worse than 3.5 when it comes to coding. I guess all that AI slop on the internet is starting to be used on training models.
@alpedistas1
@alpedistas1 4 күн бұрын
Really? How did you came to the conclusion that its now worse at code than 3.5?
@user-zu6ts5fb6g
@user-zu6ts5fb6g 4 күн бұрын
Its not worse at code, maybe you are just worse at prompting?
@xnlogical_1108
@xnlogical_1108 4 күн бұрын
@@alpedistas1 It keeps ignoring my instructions, forgets variables names, writes invalid syntax, and a lot more. Which I personally did not experience with 3.5
@xnlogical_1108
@xnlogical_1108 4 күн бұрын
@@user-zu6ts5fb6g I use really simple prompts because I only use AI to quickly make changes to my code, not to write code for me most of the time. So my prompts haven't really changed even though I can't really prove it, but I can assure you they are as simple as they get. And 4o seems to do worse compared to 3.5. This is all personal experience though.
AI can't cross this line and we don't know why.
24:07
Welch Labs
Рет қаралды 515 М.
A Student Lived With A Horrible Roommate And It Ruined His Life
14:26
Секрет фокусника! #shorts
00:15
Роман Magic
Рет қаралды 107 МЛН
小丑在游泳池做什么#short #angel #clown
00:13
Super Beauty team
Рет қаралды 39 МЛН
POV: Your kids ask to play the claw machine
00:20
Hungry FAM
Рет қаралды 15 МЛН
James May finally drives the Tesla Cybertruck
14:15
James May’s Planet Gin
Рет қаралды 4,6 МЛН
Is Computer Science still worth it?
20:08
NeetCodeIO
Рет қаралды 95 М.
Mojang just made Ender Pearls absolutely OP!
8:14
Phoenix SC
Рет қаралды 1,4 МЛН
This new type of illusion is really hard to make
17:58
Steve Mould
Рет қаралды 436 М.
10 regrets of experienced programmers
8:16
Fireship
Рет қаралды 1,3 МЛН
Okay, I'm a bit scared now...
28:05
Theo - t3․gg
Рет қаралды 93 М.
How Optimizations made Mario 64 SLOWER
20:41
Kaze Emanuar
Рет қаралды 291 М.
We Put 7 Uber Drivers in One Room. What We Found Will Shock You.
12:03
More Perfect Union
Рет қаралды 2,8 МЛН
I Built The First LAMINAR FLOW ROCKET ENGINE
15:51
Integza
Рет қаралды 2 МЛН
My game is 262,000 times faster than Minecraft. I'll show you how.
12:20
IGoByLotsOfNames
Рет қаралды 919 М.
Секрет фокусника! #shorts
00:15
Роман Magic
Рет қаралды 107 МЛН