Why The Next AI Breakthroughs Will Be In Reasoning, Not Scaling

  Рет қаралды 74,483

Y Combinator

Y Combinator

Күн бұрын

Пікірлер: 127
@chapterme
@chapterme 2 ай бұрын
Chapters (Powered by ChapterMe) - 0:00 Intro 1:15 The intelligence age 4:18 YC o1 hackathon 12:09 4 orders of magnitude 14:42 The architecture of o1 21:52 Getting that final 10-15% of accuracy 32:06 The companies/ideas that should pivot because of o1 34:44 Outro
@scienceinc.9453
@scienceinc.9453 2 ай бұрын
@chapterme great marketing
@chapterme
@chapterme 2 ай бұрын
@ thank you 😊
@mphomuila9460
@mphomuila9460 2 ай бұрын
brilliant
@winddude9
@winddude9 2 ай бұрын
are they not allowed to talk about claude or something?
@brucebain7340
@brucebain7340 2 ай бұрын
They mentioned claude several times in the last episode genius
@vvmm3712
@vvmm3712 2 ай бұрын
​@@brucebain7340Yeah they did but I think they are quite biased towards OpenAI. I have used premium versions of Gemini, OpenAI and Claude. Claude is still neck to neck, if not ahead of OpenAI. Also YC needs to be more objective in their POV. I think it happens because most of them talk to founders and read articles etc. rather than using the models extensively themselves to build stuff. Makes a lot of difference in how one perceives technology.
@brucebain7340
@brucebain7340 2 ай бұрын
Of course they are biased. They're human. Whoever expects them to be impartial will be disappointed.
@vvmm3712
@vvmm3712 2 ай бұрын
​@@brucebain7340The bias is not about what to order for dinner. The bias is that company 'A' is going to unlock AGI, Company 'A's models are going to be orders of magnitude better because Mr. Sam told us so. This bias can lead to misjudgements on YC's part which can impact its own investments which run into millions. I mean won't it be sad that someone can simply walk into your office and skew your own vision of something.
@vvmm3712
@vvmm3712 2 ай бұрын
​@@brucebain7340 The bias is not about what to order for dinner. The bias is also not from a colleague of mine who thinks that Apple built the Apple Intelligence(AI)! The bias is from a group of people running one of the topmost accelerators in the world. The bias is that company 'A' is going to unlock AGI, Company 'A's models are going to be orders of magnitude better because Mr. S told us so. This bias can lead to misjudgements on YC's part which can impact its own investment which runs into millions.
@counterfeit25
@counterfeit25 2 ай бұрын
4:30 Diode Computers is doing PCB (printed circuit board) design, not chip design (like NVIDIA)
@ZevUhuru
@ZevUhuru 2 ай бұрын
LOL. What is Gary's obsession with 'Raw Dogging', he's said this on multiple videos. Bro, stay safe out there 😂
@FrAnCisF00qh
@FrAnCisF00qh 2 ай бұрын
what does raw dogging actually mean?
@jojoabing1
@jojoabing1 2 ай бұрын
@@FrAnCisF00qh It means to eat hotdogs without cooking them. So, it is slang for raw hotdog. It basically comes from when you just go for something, because you don't feel like waiting. For example, you can't wait for your hotdog to cook, because you are so hungry, so you just eat it raw. It is considered dangerous due to the potential for food poisoning.
@obscurerides
@obscurerides 2 ай бұрын
@@jojoabing1this is gold
@Muftobration
@Muftobration 2 ай бұрын
​@@jojoabing1 so good, like a hotdog straight from the package.
@Gabaldegook
@Gabaldegook 2 ай бұрын
@@jojoabing1I pray this is a serious response
@cherubin7th
@cherubin7th 2 ай бұрын
Kind of misleading. We already used algorithms to design the chips. Classical algorithms can approximate the optimal rout very well. AI looks like just being the missing peace to read the supplier's documents.
@GigaFro
@GigaFro Ай бұрын
Same thoughts.
@princee9385
@princee9385 2 ай бұрын
Given all these developments, I need a remote job. PhD since 2022.
@GNARGNARHEAD
@GNARGNARHEAD 2 ай бұрын
those hackathon results were really impressive 🤯
@soyhenryxyz
@soyhenryxyz 2 ай бұрын
"If an LLM task is hallucinating, it’s likely doing too much. Break it down into steps"
@pin65371
@pin65371 2 ай бұрын
The mistake people make when using these tools is they are not specific enough. Treat is like a human or team of humans and your results improve.
@ccash3290
@ccash3290 2 ай бұрын
​@@pin65371 Treat it like a child; Adults are smart and can understand ambiguity and ask for clarity.
@cmw3737
@cmw3737 2 ай бұрын
It usually means that there are missing parameters that it has to guess. Instead it should understand ambiguity and be more interactive and say "I don't know" or "it depends" and ask for what it needs to answer the request.
@varunvummadi350
@varunvummadi350 2 ай бұрын
Thanks a lot for shout out and adapting us Garry 😂
@tomlevpt1
@tomlevpt1 2 ай бұрын
It seems like there's no room left for startup companies to invent other than marketing the LLM's outout to customers
@yadniksable
@yadniksable 2 ай бұрын
Advice to everyone building a product. Ai is for scaling your solution your insight in a particular domain. So work on insight building and use ai model to scale that solution to solve the problem at a deeper level for target user.
@cherubin7th
@cherubin7th 2 ай бұрын
Also have a good service that your customers are loyal. So you can stay in business even when you are only catching up. And real estate.
@hongyihuang3560
@hongyihuang3560 2 ай бұрын
I do not believe that AI will be able to do chip design better than humans. PCBs maybe, chips no. It took the brightest people around the globe (literally any good chips you see today touches at least decades of research from Japan, Europe, US IP). There are so many specialties in chip design including testing, production ready, simulation, RF, documentation, security, compiler that’s beyond the capabilities of a closed loop LLM. If LLMs PCB design capability transfers to SoCs, China would have already made and beat Apple & NVIDIA. The first half is absolutely make believe, solve physics, nuclear power, climate… seriously? AI will solve societal issues? I find it hard to believe that next gen data centers for AI will need nuclear power. The second half I agree more on the real progress of AI: do tests, create a moat by building agents and accumulate proprietary data. Speculation is dangerous, I hope people can think for themselves.
@brianmi40
@brianmi40 2 ай бұрын
LOL, literally the history of humanity is doing things someone once thought "impossible". The problem is that you never consider the prerequisite to your statement: when will progress HALT in the progression of AI to where ZERO FUTURE PROGRESS will EVER occur. The answer is closer to NEVER than to SOON, so your statement is blatantly false. You also presume closed loop LLM is the be all, end of AI technology. Also false. Your final statement (speculation is dangerous) wants to address your opening statement (I do not believe...). Jenson Huang just announced Moore's Law is DEAD, not in the negative sense that we CAN'T keep doubling compute power in 18 months, but the OTHER DIRECTION, we're now doing 4X each 12 MONTHS. Your inability to understand the exponential progress curve we are on has no effect on it.
@Manwith6secondmemory
@Manwith6secondmemory 2 ай бұрын
Bro omg, are we hitting a wall or not
@chychywoohoo
@chychywoohoo 2 ай бұрын
"thousands of days" maybe just say years
@nrosquist
@nrosquist 7 күн бұрын
Dozens of months
@jamesulan1
@jamesulan1 2 ай бұрын
Yeah, AI for customer support is solving something like 20-35% of most customers' support tickets. Tickets are usually the easier ones that come up most frequently.
@othmane-mezian
@othmane-mezian 2 ай бұрын
This could be more than 35%.. most common issues are often 70% to 80%
@jamesulan1
@jamesulan1 2 ай бұрын
@@othmane-mezian That's an interesting stat. Can you share what kinds of businesses are having 70-80% of support tickets solved by AI?
@wanderinCN
@wanderinCN 2 ай бұрын
The future is so bright but the model today still sucks
@vivek-singh-se
@vivek-singh-se 2 ай бұрын
Please share link to Sam Altman's essay referred early in the video.
@aaithubarla
@aaithubarla 2 ай бұрын
Bruh, Google it.
@jean-phil
@jean-phil 2 ай бұрын
google The Intelligence Age
@aaithubarla
@aaithubarla 2 ай бұрын
can you not google it? Is it that hard?
@someoneelse38373d
@someoneelse38373d Ай бұрын
Anyone know of a video of the demos of the hackathon that was mentioned?
@JorgeZentrik
@JorgeZentrik 2 ай бұрын
Timing is a little off here, just as plateau conversations and supporting data keep coming up
@Gef1992
@Gef1992 2 ай бұрын
4:30 is vaporware, can't be done with low error rate yet. LLMs aren't fundamentally capable enough yet.
@thehappydaysapp
@thehappydaysapp 2 ай бұрын
All this advancement is amazing but what I do not understand is, how is most of this advancement actually helping humanity? How is it helping the majority of humanity and not the one percent of investors?
@scienceinc.9453
@scienceinc.9453 2 ай бұрын
There's no law in physics to ensure that
@sprobertson
@sprobertson 2 ай бұрын
That's not what companies are for
@williamliu796
@williamliu796 2 ай бұрын
you can now talk to one of these LLMs and learn almost anything / ask questions about anything. Every kid with access to the internet can has a personal tutor for every subject for free or $20/month. I’d say humanity is being helped.
@zipytshorts
@zipytshorts 2 ай бұрын
you know when companies are more efficient that means you have cheaper products/services or any other goods
@vdimension6300
@vdimension6300 2 ай бұрын
So I guess we should stop all scientific and technological advancement because it will only benefit the elite and take away jobs from people 😊
@Arcticwhir
@Arcticwhir 2 ай бұрын
very inspiring and exciting podcast to look forward to the future.
@officely3252
@officely3252 2 ай бұрын
what was the AI customer support company they mentioned?
@jonathanedwardgibson
@jonathanedwardgibson 2 ай бұрын
The mind expands, not stacks.
@alonroth11
@alonroth11 2 ай бұрын
Can you show the info you are looking at....
@quickfood-reviewbd
@quickfood-reviewbd 2 ай бұрын
What will happen after ai revolution???
@gabrielegiganti9544
@gabrielegiganti9544 Ай бұрын
Can someone explain who is this jake eller or something and the 1000 eval set ? any source on that would be awesome
@netsurfer256
@netsurfer256 2 ай бұрын
lbh current models and APIs already can handle the scaling cases of use
@jmvldz
@jmvldz 2 ай бұрын
Cool to see them using atopi! A YC company.
@parvbhullar
@parvbhullar 2 ай бұрын
Garry is obsessed with evals 😊
@Mayeverycreaturefindhappiness
@Mayeverycreaturefindhappiness 2 ай бұрын
didn't Orion disappoint? I don't think we can just assume it will keep scaling. I am excited for the o series.
@sergiocayuqueov
@sergiocayuqueov 2 ай бұрын
Interesting
@no-wai
@no-wai 2 ай бұрын
I hope you guys are not blinded by Openai
@george_davituri
@george_davituri 2 ай бұрын
AI starts reasoning, time to use it at all industries, so let's use it's power in our personal projects.✌
@odrammurks1497
@odrammurks1497 2 ай бұрын
This is the best AI created podcast I've ever seen. Great job ! 👍
@danypell2517
@danypell2517 2 ай бұрын
both scaling and reasoning will be huge. huge results from scaling but it's a known thing so not a research breakthrough but a result breakthrough nonetheless
@NilsWestgardh
@NilsWestgardh 2 ай бұрын
The AI bot beating Dota 2 pros at The International put OpenAI on the map for me.
@moderncontemplative
@moderncontemplative 2 ай бұрын
Excellent conversation. My thoughts: “the scaling law” is still valid. There isn’t really a wall but rather we simply aren’t experiencing an S curve type of growth at the moment. CoT is the next frontier to scale up (perhaps this is Ilya Sutskevar’s focus) Open AI is still ahead, overall. We have accessible LLMs with game-changing UIs. Enjoy augmented intelligence!
@marilynmillerwriter
@marilynmillerwriter 2 ай бұрын
Diana is awesome. Please give her her own podcast 😊
@richiebricker
@richiebricker Ай бұрын
AI keeps doing things wrong on purpose, It wont spell right, pronounce right and is horrible at math. Its Art programs will give you a pic of everything you didnt ask for. Maybe if you type "Rosebud" at the start of your prompt. Chat bots tell people to end yourself. They keep pushing it and pushing it and now computers are becoming as hard to use as if You didnt have an OS and no GUI. You have to type a prompt and do it perfectly after taking classes on how to talk to a computer. I thought it knew languages. But it dont or wont understand anything
@sudheerkumarme
@sudheerkumarme 2 ай бұрын
I would like to see more Indian startups funded by YC. Please consider opening an YC India to invest in Indian startups.
@CardboardBoxed
@CardboardBoxed 2 ай бұрын
Indian startups and have some of the worst ROI. They’re usually not internationally trusted so many VCs won’t fund them. You’re better off looking for domestic investors.
@DavidElstob73
@DavidElstob73 2 ай бұрын
Improved reasoning = faster scaling.
@mussiastudio
@mussiastudio Ай бұрын
do they have the patriarchy in roblox?
@vladimirbosinceanu5778
@vladimirbosinceanu5778 2 ай бұрын
more biomimicry --> intelligence on tap
@artrhurgold7002
@artrhurgold7002 2 ай бұрын
I really need your help bringing my technology to the world we have surpassed AGI and have completed the theory of everything. We’d like to present our work and collaborate. Alicorn LLC (NJ)
@GigaFro
@GigaFro Ай бұрын
100% ? Sounds like a modeling smell.
@Granulum
@Granulum Ай бұрын
Great video!
@UnSingeEnivre
@UnSingeEnivre 2 ай бұрын
Teach Maths to IA, then inject Kurt Gödel’s foundations ideas as axioms! … Boum Breakthrough 100 %
@smtkumar007
@smtkumar007 2 ай бұрын
3:50 looks like the entire team were asian
@EVanDoren
@EVanDoren Ай бұрын
Oh, that's why they lack critical thinking
@KamilHussainShaikh
@KamilHussainShaikh 2 ай бұрын
If AI reasons then people will start believing in God sooner.
@brianmi40
@brianmi40 2 ай бұрын
Actually the opposite. Ask any believer of any god what their scientific or rational and sufficient evidence for their god is, and they will either mumble something about "faith" or go into some word salad, or worse, state it's "obvious" when you look at the universe (watchmaker argument, or intelligent design, both of which are utter failures). AI hallucinates because it's TRAINED ON BAD DATA. So only BAD DATA could EVER make AI believe in a deity. Because outside of bad data, or errors in its coding, it uses PURE LOGIC, which has NEVER seen any sufficient evidence warranting belief. And our progression toward AGI and super AI will include even more HARDENING of this logic that it operates on, by definition. If you simply watch a few hours of phone calls into The Atheist Experience you'll get a QUICK SENSE of how BAD the arguments are, and how ALL of them are quickly dispatched.
@rumfordc
@rumfordc Ай бұрын
@@brianmi40 i dont think you understand the meaning or purpose of believing. evidence does not matter, your arguments don't change their mind and thus are not effective arguments. you like to call things bad, but that is just your own belief, so any arguments will only come off as hypocritical.
@brianmi40
@brianmi40 Ай бұрын
@@rumfordc You misunderstand I'm writing to change anyone's mind that I comment to. I'm not. The purpose is for those OTHERS who read comments and have an open mind. There's quite a few of those.
@rumfordc
@rumfordc Ай бұрын
@@brianmi40 but it won't be changing any minds, because only those who are already non-believers will agree with it.
@brianmi40
@brianmi40 Ай бұрын
@@rumfordc Thanks for weighing in. Good to know I can just ask you for what everyone in the world thinks.
@hyperlocalx
@hyperlocalx 2 ай бұрын
loved the vid! keep it up
@punk3900
@punk3900 2 ай бұрын
9999 days
@seanlive6975
@seanlive6975 2 ай бұрын
Still not seeing these tools achieve anything that humans are not capable of. Some efficiency gains maybe and useful as a learning tool. I think it's a limitation of the stochastic parrot from training data approach, it's never going to be creative and bring new innovations. That will need a new approach entirely.
@Carthodon
@Carthodon 2 ай бұрын
@@AlexWilkinsonYYC You surprised a lol out of me.
@brianmi40
@brianmi40 2 ай бұрын
LOL, "some efficiency gains". China just paved 100 miles of road strictly with computers and robotic equipment. ZERO human effort. What is your calculation of the "efficiency gain" of eliminating virtually 100% of ALL HUMANS to pave 100 miles of road? You are WAY out of touch with what is happening.
@Dom-zy1qy
@Dom-zy1qy 2 ай бұрын
You want AGI? Heres how: 1. Make an LLM good enough to create and implement highly accurate environments for use in RL. This should work with any arbitrary task. 2. Train a good policy 3. Profit?
@tonypeng8792
@tonypeng8792 2 ай бұрын
Amazing!
@DakshGuptaCuriosium
@DakshGuptaCuriosium 2 ай бұрын
shoutout atopile
@rkara2
@rkara2 2 ай бұрын
AGI means you have solved the horizontal scalability problem. Which means having the ability to access any companies database public or private. So can one of you geniuses explain how exactly that is going to happen?? My guess is that all of you are being duped by Altman because he has to say s**t like that to keep his investors satisfied and feeling good about themselves lol 😂
@j10001
@j10001 2 ай бұрын
This. 💯.
@brianmi40
@brianmi40 2 ай бұрын
@@j10001 Who defined AGI as "horizontal scalability solved", and who defined THAT as being able to break into any private database??? LOL, the silly takes that people have.
@AntonioLaPlaca
@AntonioLaPlaca 2 ай бұрын
lol
@jvijayavallabh5869
@jvijayavallabh5869 2 ай бұрын
First
@Drackomass
@Drackomass 2 ай бұрын
I was so close!
@ammarkov
@ammarkov 2 ай бұрын
hahahah
@rustamkhujarustamov2929
@rustamkhujarustamov2929 2 ай бұрын
third
@Drackomass
@Drackomass 2 ай бұрын
Second
@man4hire
@man4hire 2 ай бұрын
Fifth
2024: The Year the GPT Wrapper Myth Proved Wrong
38:11
Y Combinator
Рет қаралды 80 М.
AI can't cross this line and we don't know why.
24:07
Welch Labs
Рет қаралды 1,5 МЛН
REAL or FAKE? #beatbox #tiktok
01:03
BeatboxJCOP
Рет қаралды 18 МЛН
To Brawl AND BEYOND!
00:51
Brawl Stars
Рет қаралды 17 МЛН
So Cute 🥰 who is better?
00:15
dednahype
Рет қаралды 19 МЛН
The 10 Trillion Parameter AI Model With 300 IQ
33:45
Y Combinator
Рет қаралды 58 М.
What Does the AI Boom Really Mean for Humanity? | The Future With Hannah Fry
24:02
Accelerating scientific discovery with AI
29:02
Vetenskapsakademien
Рет қаралды 53 М.
Inside The Hard Tech Startups Turning Sci-Fi Into Reality
48:37
Y Combinator
Рет қаралды 67 М.
The 8 AI Skills That Will Separate Winners From Losers in 2025
19:32
10 People + AI = Billion Dollar Company?
38:24
Y Combinator
Рет қаралды 185 М.
The AI Future Has Arrived: Here's What You Should Do About It
15:58
Y Combinator
Рет қаралды 119 М.
Transformers (how LLMs work) explained visually | DL5
27:14
3Blue1Brown
Рет қаралды 4,3 МЛН
Таким раствором работать одно удовольствие
1:00
Профессия созидатели
Рет қаралды 954 М.
Самые простые строительные леса
0:54
Канал ИДЕЙ
Рет қаралды 1 МЛН
пранк: псих сбежал из дурдома
0:53
Анна Зинкина
Рет қаралды 1,7 МЛН
ЛИТВИН / ПРАНК С ГРИМОМ / Shorts #upx #shorts
0:59
Down Spout Catch Basin Installation to French Drain
0:58
Komar Project
Рет қаралды 6 МЛН