Quick recap on the state of language model reasoning

  Рет қаралды 3,205

Interconnects AI

Interconnects AI

Күн бұрын

Пікірлер: 5
@TheFireHacker
@TheFireHacker 17 күн бұрын
12:12 "This something we did in our project" are you talking about allenai open instruct cot model which is on Hugginface?
@interconnects
@interconnects 17 күн бұрын
Tulu 3 is trained with RLVR, but its a general chat model not a reasoning focused model.
@noa-h5b
@noa-h5b 6 күн бұрын
Hi Nathan, thanks for the videos you are creating! I wanted to ask you for your opinion. I want to agin (I did a phd in information retrieval in 2014 and since then I stopped doing research) do research in LLM and I am quite lost with all the new papers, research topics .... so, I decide to get into RLHF, and I don t know what are worth exploring and research questions to tackle, if you have any recommendation on that Thanks for your help
@interconnects
@interconnects 6 күн бұрын
I have an upcoming post on the blog about character training and other things I'm interested in. Generally, you have to commit to one thing and keep at it through the noise.
@bob-007
@bob-007 18 күн бұрын
First! :)
How AI Reasons | From AlphaGo to ChatGPT
17:24
Art of the Problem
Рет қаралды 76 М.
IL'HAN - Qalqam | Official Music Video
03:17
Ilhan Ihsanov
Рет қаралды 700 М.
Мен атып көрмегенмін ! | Qalam | 5 серия
25:41
UFC 310 : Рахмонов VS Мачадо Гэрри
05:00
Setanta Sports UFC
Рет қаралды 1,2 МЛН
SOLVED: Perfect Reasoning for every AI AGENT (ReasonAgain)
24:55
[Webinar] How to Build a Modern Agentic System
1:00:55
Arthur
Рет қаралды 11 М.
What P vs NP is actually about
17:58
Polylog
Рет қаралды 146 М.
Jason Wei: Scaling Paradigms for Large Language Models
40:10
Mayur Naik
Рет қаралды 5 М.
Berry's Paradox - An Algorithm For Truth
18:34
Up and Atom
Рет қаралды 476 М.
Large Concept Models (LCMs) by Meta: The Era of AI After LLMs?
10:23
AI Papers Academy
Рет қаралды 25 М.
Visualizing transformers and attention | Talk for TNG Big Tech Day '24
57:45
NEW: Better In-Context Learning ICL, Improved RAG (Harvard)
26:43