OpenAI's Noam Brown, Ilge Akkaya and Hunter Lightman on o1 and Teaching LLMs to Reason Better

  Рет қаралды 34,021

Sequoia Capital

Sequoia Capital

Күн бұрын

Пікірлер: 78
@user-pt1kj5uw3b
@user-pt1kj5uw3b 29 күн бұрын
I hate to thank our corporate VC overlords, but these interviews are pretty cool. I think they will be historically significant in a few years.
@sup3a
@sup3a 25 күн бұрын
100%
@rollingrock3480
@rollingrock3480 11 күн бұрын
They will be legally significant as an example of how big tech companies have defeated the spirit of the law time and time again to the overall detriment of society. (Remember Facebook giving 5 points for an angry reaction, and 1 point for a like, when it comes to recommending posts for your FB feed, then telling congress they want to "Bring us all together"?)
@andrewwalker8985
@andrewwalker8985 26 күн бұрын
You can see the value of open source in this interview. We don’t get smart people sharing their thoughts and excitement openly, we got smart people who were excited and would love to share armed with pre-approved sentences that they were allowed to say.
@sup3a
@sup3a 25 күн бұрын
Very good podcast thank you. No extra hype, just very matter of factly. Just what i need in the middle of all the hype
@NandoPr1m3
@NandoPr1m3 29 күн бұрын
I like that we are getting to see the real people behind the curtain at OpenAI. My big takeaway is that they A) have other ideas being researched and B) that they aren't afraid to try new paradigms, which is basically what led to the o1 Models.
@rickandelon9374
@rickandelon9374 29 күн бұрын
Btw the capital of Bhutan is Thimpu. Mostly hilly country and the world's only negative carbon output country.
@marwin4348
@marwin4348 29 күн бұрын
Sucks for them, they still did not archieve industrialisation?
@rickandelon9374
@rickandelon9374 29 күн бұрын
@@marwin4348the country is expensive as hell and poor interms of self dependence. mostly they import their goods from India which bullies them constantly with various political pressure.
@ominousplatypus380
@ominousplatypus380 26 күн бұрын
"Mostly hilly" might be the understatement of the century. The entirety of the country is enveloped by the Himalayas and it's arguably the most mountainous country that exists.
@andrewwalker8985
@andrewwalker8985 26 күн бұрын
@@marwin4348 that seems uncalled for
@-rate6326
@-rate6326 13 күн бұрын
​@@rickandelon9374 india doesn't actually bullie Bhutan. India is security garrantor for Bhutan against china. This year indian allotted 267 million USD for Bhutan. India doesn't need to bullie Bhutan. Bhutan just accepts whatever india says. Bhutani military is trained by india. They train in India. Real bullie is china china is responsible for salami slicing around bhutani borders. India and bhutan has ten-article, perpetual treaty signed right after independence. In this treaty india can't interfere in bhutan's internal matters. Bhutan's external matters are guided by india. Recently china said if bhutani permanently agrees to give certain part of bhutan to china they will return the part china has taken from Bhutan. Bhutan was agreeing to this but india said it's Chinese trap. Why bhutan should permanently give the territories that belongs to bhutan. What you are saying is probably Chinese influence operation. China is big bullie in asia
@emmanuelgoldstein3682
@emmanuelgoldstein3682 Ай бұрын
That's the biggest bowl of strawberries I've ever seen
@solomonmatthews7921
@solomonmatthews7921 Ай бұрын
Large beyond reason.
@tomenglish9340
@tomenglish9340 29 күн бұрын
@@solomonmatthews7921 Perhaps not strawberries all the way down.
@avraham4497
@avraham4497 4 күн бұрын
@@tomenglish9340what model are you?
@tomenglish9340
@tomenglish9340 4 күн бұрын
@@avraham4497 Model T
@senju2024
@senju2024 Ай бұрын
They have "Strawberries" on the table while talking about O1. NICE!~
@xiaoxiandong7382
@xiaoxiandong7382 25 күн бұрын
It's funny the researchers kept looking at the paper in front of them. Does it say what they can say vs not?
@spinvalve
@spinvalve 7 күн бұрын
Is it just me but is the male host from Sequoia there a doppelganger of 3Blue1Brown? Both his voice and appearance is stupendously uncanny
@thatthotho
@thatthotho Ай бұрын
How many R's are in the bowl?
@Crux69
@Crux69 29 күн бұрын
Technically, none :D
@tomenglish9340
@tomenglish9340 29 күн бұрын
There are 3 R's in STRAWBERRIES, as in STRAWBERRY.
@adityakrishnaakula746
@adityakrishnaakula746 29 күн бұрын
Quite cheeky 😂 that they have a bowl of strawberries there
@constantinelinardakis8394
@constantinelinardakis8394 21 күн бұрын
21:30 on STEM in hard reasoning thats why o1 is so good
@JoshuaGottlieb-oz4er
@JoshuaGottlieb-oz4er 28 күн бұрын
Great content; thank you
@uw10isplaya
@uw10isplaya 29 күн бұрын
24:11 is the most interesting topic in AI for me
@PaddyLamont
@PaddyLamont Ай бұрын
That little beep sound before the intro had me guessing whether my headphones had gone haywire.
@user-pt1kj5uw3b
@user-pt1kj5uw3b 29 күн бұрын
Same. Felt like a telegram operator interpreting morse code for a second. They need to add a visual component.
@JumpDiffusion
@JumpDiffusion 29 күн бұрын
9:15 so he basically avoided the question 😏
@vnehru1
@vnehru1 28 күн бұрын
Yes. Also noticed.
@MrC0MPUT3R
@MrC0MPUT3R 23 күн бұрын
He didn't really avoid it; he just said he doesn't know. They're hoping that as the reasoning method the model uses is tested in a diverse set of domains that the weaknesses and strengths become clear so that at some point in the future they can actually answer that question and further refine training methods.
@whemmakatatt5311
@whemmakatatt5311 Ай бұрын
Dayum , one to watch for suuure
@maxziebell4013
@maxziebell4013 Ай бұрын
Great discussion
@user-wr4yl7tx3w
@user-wr4yl7tx3w Ай бұрын
What’s on the paper? Why everyone is staring at theirs?
@Crux69
@Crux69 29 күн бұрын
PR and Legal notes from their internal ASI ;)
@tomenglish9340
@tomenglish9340 29 күн бұрын
@@Crux69 That's what it looks like to me -- no joke.
@alexiscao8749
@alexiscao8749 Ай бұрын
The definition of reasoning: @the ability to consider more options and evaluate the correctness of the choice" isn't that Search for optimal?
@tomenglish9340
@tomenglish9340 29 күн бұрын
In a recent talk (I've forgotten which), he made it clear that he was conflating search with reasoning. I wouldn't do that, but I don't think it's a sin.
@constantinelinardakis8394
@constantinelinardakis8394 21 күн бұрын
26:22 def on agi
@constantinelinardakis8394
@constantinelinardakis8394 24 күн бұрын
12:00 training on tons of data
@constantinelinardakis8394
@constantinelinardakis8394 21 күн бұрын
36:00 on just data and timr
@prince-din
@prince-din 29 күн бұрын
Why can't i open my SeqCap?
@redyican5341
@redyican5341 29 күн бұрын
It will look like this: math > programming > simulations > agents > answering hard open questions IMO it easier to source info from real world that to run some simulations. Infinite IQ doesn’t exist. IQ is search is solution space and it has constraints even with best heuristics and we can see in humans that these heuristics are maladaptive when applied to too narrow problems. So for single model trained on general questions wont develop these insane heuristics of 170 IQ people. MoE architecture can kinda have this high IQ in different domains. Also I think there needs to be ability to act/experiment to answer some harder open problems. I still think we need to master online learning but it’s likely that better training on long context can achieve it. Even better if it could adjust weights after
@redyican5341
@redyican5341 29 күн бұрын
I think actually one need to have model rerun after outputting stop token and decide to which questions it want to have answers after own reasoning chain, adjusting these knowledge weights. I kinda know it works like that in pretraining with synthetic data but would be cool to have it live
@DanielleNewnham
@DanielleNewnham 28 күн бұрын
Thimphu is the capital of Bhutan. You're welcome :)
@Mayeverycreaturefindhappiness
@Mayeverycreaturefindhappiness 28 күн бұрын
they never answered if they have a ongoing experiment where they let it keep thinking.
@Drackomass
@Drackomass Ай бұрын
Fastest click ever
@mpnikhil
@mpnikhil 27 күн бұрын
The capital of Bhutan is Thimphu. System 1 human response 😂.
@redyican5341
@redyican5341 29 күн бұрын
Limit is in energy. We would need energy to outcompete humanity. If it can be cheaper per watt. I think it might work because it doesn’t have to be that general. Anyway happy that rich noobs finally will invest in more energy
@BrutalStrike2
@BrutalStrike2 13 күн бұрын
18:38
@findjoseph
@findjoseph 29 күн бұрын
W
@superfliping
@superfliping 29 күн бұрын
Now that most of your top leadership is gone seems like they don't want to invest in it anymore kind of a contradiction to what we are seeing
@constantinelinardakis8394
@constantinelinardakis8394 24 күн бұрын
17:38 left off
@OBGynKenobi
@OBGynKenobi 28 күн бұрын
It's not thinking, it's calculating. No one thinks that Mathematica or Wolfram alpha is thinking.
@MrC0MPUT3R
@MrC0MPUT3R 23 күн бұрын
You're not thinking. Your brain is just undergoing some electrochemical reactions.
@jamdec123
@jamdec123 29 күн бұрын
interesting enough conversation However, it may be beneficial to have people possessing models first before designing models clearly. There's a lack of life experience somewhere. anywho, I'll let these guys get back to facilitating AI on how, They can best lick their own parts, PeaceOUT
@attilaszasz-mb2sj
@attilaszasz-mb2sj 5 күн бұрын
someone please tell these people that o1 is not good at all :D
@videochampion
@videochampion 18 күн бұрын
Dude is smart but such a dorky speaker
@videochampion
@videochampion 18 күн бұрын
Take your time to process your output, like O1... Erhm uhm erhm lol
@user-wr4yl7tx3w
@user-wr4yl7tx3w Ай бұрын
Does the seating make sense? I think it would have been better to have girls seated in the center given their height stature.
@thenoblerot
@thenoblerot 29 күн бұрын
I am so unreasonable annoyed for her awful framing! And make sure the guests mics aren't blocking their face!? That said... I was mostly listening, as I'm sure most are. Great talk regardless!
@tomenglish9340
@tomenglish9340 29 күн бұрын
Randomized to ensure that there was no gender bias.
@wwkk4964
@wwkk4964 29 күн бұрын
Noam has been saying to room fulls of people "you csnt know the capital of Bhutan", which is a very silly (and offebsive example). Most Chinese and Indian sub continent people (40% of world populatikn ) know its thimphu since elementary school.
@wwkk4964
@wwkk4964 29 күн бұрын
I'm just saying because it detracts from his other well thought out points but his example is so poor it will make him lose credibility in the wrong audience who can't judge the rest of his claim.
@DavidToddSports
@DavidToddSports 29 күн бұрын
That is not what he is saying. He is saying if you don't know the answer when the question is asked, there is no amount of time which is going to allow you to "think" the answer.
@wwkk4964
@wwkk4964 29 күн бұрын
@@DavidToddSports it's not a good example because it's not demonstrating the salience of the point he is making. It's equivalent to saying, no amount of thinking will help you recognise the capital of Australia or canada or Brazil, but is this strictly true?
@wwkk4964
@wwkk4964 29 күн бұрын
@@DavidToddSports here's another way to think about why it's a defective example: "No amount of thinking is going to allow you to know if baseball comes from cricket or vice versa." Is this kind of statement a good example unreachable or computationally disconnected island of knowledge ? I don't think so, it muddles things up because the question is undecidable.
@jinhongyu911
@jinhongyu911 24 күн бұрын
@@wwkk4964 I think the example he gave about the capital of Bhutan is perfectly sound given the topic of reasoning. Rather, I think your example of baseball that's the wrong type of example of give the question. What Noam is saying is that, unless you've heard of the name of the capital of Bhutan before, there is no amount of time which is going to allow you to reason out the answer (like david said above), given that you don't have access to the internet or books of course. As for your example of baseball, if you can have enough facts and historical records on hand, I'm sure you can come to a reasonable conclusion of which one came first. Just like the chicken and egg problem, if you have can set the right definitions, you can surely reason out the answer. So again, the capital of Bhutan is not a 'reasoning' problem, you can't figure it out step by step, you either know it or you don't.
OpenAI CEO Sam Altman discusses the future of generative AI
52:44
Michigan Engineering
Рет қаралды 62 М.
Wait for it 😂
00:19
ILYA BORZOV
Рет қаралды 10 МЛН
I Interviewed the Man Behind ChatGPT: Sam Altman
48:39
David Perell
Рет қаралды 91 М.
Learning to Reason with LLMs
52:03
Simons Institute
Рет қаралды 4,6 М.
How Domain-Specific AI Agents Will Shape the Industrial World in the Next 10 Years
32:29
No Priors Ep. 80 | With Andrej Karpathy from OpenAI and Tesla
44:17
No Priors: AI, Machine Learning, Tech, & Startups
Рет қаралды 179 М.
Why Vertical LLM Agents Are The New $1 Billion SaaS Opportunities
37:06
A fireside chat with Sam Altman OpenAI CEO at Harvard University
1:00:15
Harvard Business School
Рет қаралды 185 М.
It's Not About Scale, It's About Abstraction
46:22
Machine Learning Street Talk
Рет қаралды 78 М.