Quick recap on the state of language model reasoning

How AI Reasons | From AlphaGo to ChatGPT

The State of Reasoning - from Nathan Lambert, Interconnects/AI2 [LS Live @ NeurIPS 2024]

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

IL'HAN - Qalqam | Official Music Video

Мен атып көрмегенмін ! | Qalam | 5 серия

UFC 310 : Рахмонов VS Мачадо Гэрри

Quick recap on the state of language model reasoning

Рет қаралды 3,205

Interconnects AI

Interconnects AI

Күн бұрын

Пікірлер: 5

@TheFireHacker 17 күн бұрын

12:12 "This something we did in our project" are you talking about allenai open instruct cot model which is on Hugginface?

@interconnects 17 күн бұрын

Tulu 3 is trained with RLVR, but its a general chat model not a reasoning focused model.

@noa-h5b 6 күн бұрын

Hi Nathan, thanks for the videos you are creating! I wanted to ask you for your opinion. I want to agin (I did a phd in information retrieval in 2014 and since then I stopped doing research) do research in LLM and I am quite lost with all the new papers, research topics .... so, I decide to get into RLHF, and I don t know what are worth exploring and research questions to tackle, if you have any recommendation on that Thanks for your help

@interconnects 6 күн бұрын

I have an upcoming post on the blog about character training and other things I'm interested in. Generally, you have to commit to one thing and keep at it through the noise.

@bob-007 18 күн бұрын

First! :)

How AI Reasons | From AlphaGo to ChatGPT

17:24

How AI Reasons | From AlphaGo to ChatGPT

Art of the Problem

Рет қаралды 76 М.

The State of Reasoning - from Nathan Lambert, Interconnects/AI2 [LS Live @ NeurIPS 2024]

16:22

The State of Reasoning - from Nathan Lambert, Interconnects/AI2 [LS Live @ NeurIPS 2024]

Latent Space

Рет қаралды 2,3 М.

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

00:42

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

Daniel LaBelle

Рет қаралды 150 МЛН

IL'HAN - Qalqam | Official Music Video

03:17

IL'HAN - Qalqam | Official Music Video

Ilhan Ihsanov

Рет қаралды 700 М.

Мен атып көрмегенмін ! | Qalam | 5 серия

25:41

Мен атып көрмегенмін ! | Qalam | 5 серия

kak budto

Рет қаралды 1,2 МЛН

UFC 310 : Рахмонов VS Мачадо Гэрри

05:00

UFC 310 : Рахмонов VS Мачадо Гэрри

Setanta Sports UFC

Рет қаралды 1,2 МЛН

SOLVED: Perfect Reasoning for every AI AGENT (ReasonAgain)

24:55

SOLVED: Perfect Reasoning for every AI AGENT (ReasonAgain)

Discover AI

Рет қаралды 9 М.

[Webinar] How to Build a Modern Agentic System

1:00:55

[Webinar] How to Build a Modern Agentic System

Arthur

Рет қаралды 11 М.

What P vs NP is actually about

17:58

What P vs NP is actually about

Polylog

Рет қаралды 146 М.

Jason Wei: Scaling Paradigms for Large Language Models

40:10

Jason Wei: Scaling Paradigms for Large Language Models

Mayur Naik

Рет қаралды 5 М.

Berry's Paradox - An Algorithm For Truth

18:34

Berry's Paradox - An Algorithm For Truth

Up and Atom

Рет қаралды 476 М.

Large Concept Models (LCMs) by Meta: The Era of AI After LLMs?

10:23

Large Concept Models (LCMs) by Meta: The Era of AI After LLMs?

AI Papers Academy

Рет қаралды 25 М.

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

57:45

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Grant Sanderson

Рет қаралды 324 М.

A Systems-Minded Approach to Creating a Music Player Application by Andrew Kelley

26:13

A Systems-Minded Approach to Creating a Music Player Application by Andrew Kelley

TigerBeetle

Рет қаралды 45 М.

NEW: Better In-Context Learning ICL, Improved RAG (Harvard)

26:43

NEW: Better In-Context Learning ICL, Improved RAG (Harvard)

Discover AI

Рет қаралды 7 М.

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

00:42

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

Daniel LaBelle

Рет қаралды 150 МЛН