No video

Reflection AI’s Misha Laskin on the AlphaGo Moment for LLMs | Training Data

  Рет қаралды 11,433

Sequoia Capital

Sequoia Capital

Күн бұрын

Пікірлер: 33
@squamish4244
@squamish4244 Ай бұрын
His comment on where we are in the late 1800s is on point. That was during the Second Industrial Revolution. We're in the beginning stages of the Third Industrial Revolution, which will happen a hell of a lot faster and with much larger effects than even that one did.
@GNARGNARHEAD
@GNARGNARHEAD Ай бұрын
this actually gets pretty good, really excited to see what Reinforcement Learning can do for second stage training on Language Models
@stephaniezhan2610
@stephaniezhan2610 Ай бұрын
Awesome to hear. Thank you!
@fintech1378
@fintech1378 Ай бұрын
Maybe 'in-context learning from human feedback' will unlock new capabilities too, langchain is working something for that?
@squamish4244
@squamish4244 Ай бұрын
And people who don't know enough about AI wonder why the next models are taking several years and not *one* year. It's because they're working on next-level stuff, people! It takes time!
@BrianMosleyUK
@BrianMosleyUK Ай бұрын
1:03:23 great advice, and one which resonates strongly with me. Great interview, thanks everyone. New subscriber.
@stephaniezhan2610
@stephaniezhan2610 Ай бұрын
Excited to hear. Thank you!
@odiseezall
@odiseezall Ай бұрын
The audience can make up its mind about the mind state of people that say "safety is reliability" and "AGI will do boring tasks for you on your computer".
@ansha2221
@ansha2221 Ай бұрын
Excellent podcast.
@dinarwali386
@dinarwali386 Ай бұрын
Fascinating. How is accuracy measured and maintained with respect to AI agents over time?
@richoffks
@richoffks Ай бұрын
It’s not simple enough to answer in a comment bro 😭 they have a bunch of metrics that are basically useless because the llms keep surpassing them but then there are human evaluation metrics as well, for example, how much sense it makes to a human.
@stephaniezhan2610
@stephaniezhan2610 Ай бұрын
I actually think this is an interesting area of opportunity. I think SWE-Bench is the best benchmark for testing real-world agentic capabilities for programming, amongst those that exist today. But we need more and better benchmarks!
@jordanmiller11
@jordanmiller11 10 күн бұрын
Note that his evidence for agents being close is they haven’t run all the tests on LLM models yet. Not that they have any verifiable evidence. Not promising.
@wwkk4964
@wwkk4964 Ай бұрын
What if reasoning is an illusion and the ground truth is just specialized associative memory of what worked.
@Gnaritas42
@Gnaritas42 Ай бұрын
Because it's not, type 2 thinking is real, type 1 thinking is what you're talking about, you can build a 2 from a 1 with some loops and more architecture. 1 comes first, 2 is an upgrade. 2 requires something 1 doesn't do, unbounded compute, aka the ability to think longer, because you're planning internally, internal thinking, and comparing your plans to choose the better one, and using logic and induction and deduction to work through them. This is what humans do, well documented, not an illusion, just a different mode.
@wwkk4964
@wwkk4964 Ай бұрын
@@Gnaritas42 perhaps declaring axioms as truths and reverse engineering data to present in a book filled with confirmation bias is not exclusively Kahneman's achievement. This appears to be your type 1 thinking it's type 2 (both are are illusory )
@Gnaritas42
@Gnaritas42 Ай бұрын
@@wwkk4964 oh god you're one of those nutjobs, no thanks. And no, they're not illusory, you're just not sane.
@ozgurgulerx
@ozgurgulerx Ай бұрын
Isn't agency more about the cognitive architecture you build on top of the LLM? e.g. the recent LATS , GoT, STAR papers, Q* etc...Nothing about that here...He seems to be more interested training his agency LLM which may not be the right direction...Not much meat overall for agency unfortunately.
@ManasSharma-e4m
@ManasSharma-e4m Ай бұрын
I am in first year i am worried about my future
@ParnianMotamedi
@ParnianMotamedi Ай бұрын
is the sequoi digital currency? it listed in pancake swap? Please answer me soon i'm in hurry
@alxfazio
@alxfazio Ай бұрын
You've got top-tier microphones, so why use AI filters that make your podcasts sound artificial? Let your natural voices shine through!
@420_gunna
@420_gunna Ай бұрын
pink jacket smiling like a bloodless jackal the entire time and rocking in her seat Chill we get it
@user-yo6vy9lx2g
@user-yo6vy9lx2g 25 күн бұрын
Super super podcast
@jbraunschweiger
@jbraunschweiger Ай бұрын
I don’t think reflection will exist in a year
@gotemlearning
@gotemlearning Ай бұрын
Why not?
@swapanjain892
@swapanjain892 29 күн бұрын
Troll
@jbraunschweiger
@jbraunschweiger 28 күн бұрын
@@gotemlearning no moat. The only reason the big players haven’t done this yet is there is not much value to most people’s writing style.
@gotemlearning
@gotemlearning 28 күн бұрын
@@jbraunschweiger writing style? what does that have to do with it? I do sort of agree about the moat -- it's hard to find one in the hottest pursuit of the 21st century -- but they might find a niche.
@mattwesney
@mattwesney Ай бұрын
This is painful
娜美这是在浪费食物 #路飞#海贼王
00:20
路飞与唐舞桐
Рет қаралды 7 МЛН
这三姐弟太会藏了!#小丑#天使#路飞#家庭#搞笑
00:24
家庭搞笑日记
Рет қаралды 105 МЛН
Please Help Barry Choose His Real Son
00:23
Garri Creative
Рет қаралды 23 МЛН
What's the future for generative AI? - The Turing Lectures with Mike Wooldridge
1:00:59
AI and Quantum Computing: Glimpsing the Near Future
1:25:33
World Science Festival
Рет қаралды 389 М.
Andrew Ng On AI Agentic Workflows And Their Potential For Driving AI Progress
30:54
The Turing Lectures: The future of generative AI
1:37:37
The Alan Turing Institute
Рет қаралды 591 М.
娜美这是在浪费食物 #路飞#海贼王
00:20
路飞与唐舞桐
Рет қаралды 7 МЛН