Reflection AI’s Misha Laskin on the AlphaGo Moment for LLMs

No video

Reflection AI’s Misha Laskin on the AlphaGo Moment for LLMs | Training Data

Рет қаралды 11,433

Sequoia Capital

Күн бұрын

Пікірлер: 33

@squamish4244 Ай бұрын

His comment on where we are in the late 1800s is on point. That was during the Second Industrial Revolution. We're in the beginning stages of the Third Industrial Revolution, which will happen a hell of a lot faster and with much larger effects than even that one did.

@GNARGNARHEAD Ай бұрын

this actually gets pretty good, really excited to see what Reinforcement Learning can do for second stage training on Language Models

@stephaniezhan2610 Ай бұрын

Awesome to hear. Thank you!

@fintech1378 Ай бұрын

Maybe 'in-context learning from human feedback' will unlock new capabilities too, langchain is working something for that?

@squamish4244 Ай бұрын

And people who don't know enough about AI wonder why the next models are taking several years and not *one* year. It's because they're working on next-level stuff, people! It takes time!

@BrianMosleyUK Ай бұрын

1:03:23 great advice, and one which resonates strongly with me. Great interview, thanks everyone. New subscriber.

@stephaniezhan2610 Ай бұрын

Excited to hear. Thank you!

@odiseezall Ай бұрын

The audience can make up its mind about the mind state of people that say "safety is reliability" and "AGI will do boring tasks for you on your computer".

@ansha2221 Ай бұрын

Excellent podcast.

@dinarwali386 Ай бұрын

Fascinating. How is accuracy measured and maintained with respect to AI agents over time?

@richoffks Ай бұрын

It’s not simple enough to answer in a comment bro 😭 they have a bunch of metrics that are basically useless because the llms keep surpassing them but then there are human evaluation metrics as well, for example, how much sense it makes to a human.

@stephaniezhan2610 Ай бұрын

I actually think this is an interesting area of opportunity. I think SWE-Bench is the best benchmark for testing real-world agentic capabilities for programming, amongst those that exist today. But we need more and better benchmarks!

@jordanmiller11 10 күн бұрын

Note that his evidence for agents being close is they haven’t run all the tests on LLM models yet. Not that they have any verifiable evidence. Not promising.

@wwkk4964 Ай бұрын

What if reasoning is an illusion and the ground truth is just specialized associative memory of what worked.

@Gnaritas42 Ай бұрын

Because it's not, type 2 thinking is real, type 1 thinking is what you're talking about, you can build a 2 from a 1 with some loops and more architecture. 1 comes first, 2 is an upgrade. 2 requires something 1 doesn't do, unbounded compute, aka the ability to think longer, because you're planning internally, internal thinking, and comparing your plans to choose the better one, and using logic and induction and deduction to work through them. This is what humans do, well documented, not an illusion, just a different mode.

@wwkk4964 Ай бұрын

@@Gnaritas42 perhaps declaring axioms as truths and reverse engineering data to present in a book filled with confirmation bias is not exclusively Kahneman's achievement. This appears to be your type 1 thinking it's type 2 (both are are illusory )

@Gnaritas42 Ай бұрын

@@wwkk4964 oh god you're one of those nutjobs, no thanks. And no, they're not illusory, you're just not sane.

@ozgurgulerx Ай бұрын

Isn't agency more about the cognitive architecture you build on top of the LLM? e.g. the recent LATS , GoT, STAR papers, Q* etc...Nothing about that here...He seems to be more interested training his agency LLM which may not be the right direction...Not much meat overall for agency unfortunately.

@ManasSharma-e4m Ай бұрын

I am in first year i am worried about my future

@ParnianMotamedi Ай бұрын

is the sequoi digital currency? it listed in pancake swap? Please answer me soon i'm in hurry

@alxfazio Ай бұрын

You've got top-tier microphones, so why use AI filters that make your podcasts sound artificial? Let your natural voices shine through!

@420_gunna Ай бұрын

pink jacket smiling like a bloodless jackal the entire time and rocking in her seat Chill we get it

@user-yo6vy9lx2g 25 күн бұрын

Super super podcast

@jbraunschweiger Ай бұрын

I don’t think reflection will exist in a year

@gotemlearning Ай бұрын

Why not?

@swapanjain892 29 күн бұрын

Troll

@jbraunschweiger 28 күн бұрын

@@gotemlearning no moat. The only reason the big players haven’t done this yet is there is not much value to most people’s writing style.

@gotemlearning 28 күн бұрын

@@jbraunschweiger writing style? what does that have to do with it? I do sort of agree about the moat -- it's hard to find one in the hottest pursuit of the 21st century -- but they might find a niche.