Scaling Test Time Compute: How o3-Style Reasoning Works (+ Open Source Implementation)

  Рет қаралды 2,476

Adam Lucek

Adam Lucek

Күн бұрын

Пікірлер
@wwkk4964
@wwkk4964 3 күн бұрын
This was amazing! Thank you for sharing! This is waiting to explode as soon as we have a single breakthrough in cost of compute. Everything, no matter how small, will be able to hac away and reason about a task.
@AdamLucek
@AdamLucek 3 күн бұрын
Intelligence too cheap to measure is the hope! Given the insane competition in the space currently, I suspect we'll see similar downward cost trends as we have with regular input/output token consumption
@wwkk4964
@wwkk4964 3 күн бұрын
@AdamLucek Yes, it seems biology is a slower scaled up version of test time compute working at the cellular and somatic level. Some experiments By Michael Levin of Tufts show their lab used Bioelectric signalling to cells in animals like frogs and planaria to have them grow or regrow completely nivel morphology just by repeatedly interfering with their "ongoing computation" about what to build. I imagine we will be able to harness biological computation in nature and set them off to solve tasks for indefinite periods of time.
@bentouss3445
@bentouss3445 11 сағат бұрын
Your videos are so full of useful information, thank you so much! Love them
@edwardtbaum2169
@edwardtbaum2169 3 күн бұрын
Love your videos man. You have a good grasp on the concepts. I've been going back through your catalogue of videos to help me brainstorm some of my own ideas.
@AdamLucek
@AdamLucek 3 күн бұрын
Thanks! Hope they help!
@tspis
@tspis 2 күн бұрын
Subscribed immediately - excellent content & packed with value!
@adomicarts
@adomicarts 3 күн бұрын
Very informative and well explained Adam
@AdamLucek
@AdamLucek 3 күн бұрын
Thanks for watching!
@MaJetiGizzle
@MaJetiGizzle 3 күн бұрын
10:23 What about the original STaR paper? That seems to be a likely candidate as well, hence the original Q-Star codename for o1.
@AdamLucek
@AdamLucek 3 күн бұрын
That’s a good one too! It’s likely a mix of a bunch of things including star methodology. I chose to highlight SCoRe since their experiments showed clear improvements over STaR, but as you’ve pointed out hard to tell exactly what without knowing!
@lavamonkeymc
@lavamonkeymc 3 күн бұрын
This is awesome great video man. Can u do a vid on more advanced training embedders for RAG or Graph Rag?
@AdamLucek
@AdamLucek 3 күн бұрын
🤔
@JJBoi8708
@JJBoi8708 2 күн бұрын
Are we able to do this locally with quants on a MacBook Pro m4 pro?
@tobywoolsey7844
@tobywoolsey7844 2 күн бұрын
Wouldn’t you think that the likelihood of the OpenAI O series of models using the “search against a verifier method” Is higher than the self refinement method? I understand why the first method “self refinement” sound most probable but your looking at “search against a verifier in a way that doesn’t consider the possibility that training would work something like this: The llm generates multiple thought steps or “actions” for a given state witch can be a previous thought step of a query. Then the verifier likely the llm itself picks the best one, and this happens iteratively until a tree like structure is formed. The llm then checks the tree for errors before picking the best path! Then the llm is rewarded if the response from that path is correct. This way you don’t require a separate verifier. Trust me I started with the Microsoft everything of thoughts paper and I can’t remember the names but similar papers have come out since then that replace the verifier with the llm itself. Also self refinement can also be part of this method if you introduce a graph like structure instead of a tree like structure
@DebiprasadGhosh
@DebiprasadGhosh 3 күн бұрын
Thanks.
@alessandrofrau4196
@alessandrofrau4196 Күн бұрын
Forest of Thoughts...
@alessandrofrau4196
@alessandrofrau4196 Күн бұрын
Or forest(s) of Thoughts?
o3: Pushing the boundaries of AGI (and of coding)
22:29
Dr Waku
Рет қаралды 33 М.
The BEST Way to Chunk Text for RAG
33:17
Adam Lucek
Рет қаралды 6 М.
Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей
00:19
It works #beatbox #tiktok
00:34
BeatboxJCOP
Рет қаралды 41 МЛН
BAYGUYSTAN | 1 СЕРИЯ | bayGUYS
36:55
bayGUYS
Рет қаралды 1,9 МЛН
My Home Server is REALLY Stupid and I Need to Fix It
23:17
Linus Tech Tips
Рет қаралды 1,4 МЛН
Inference Time Compute
45:06
Chenghao Yang
Рет қаралды 920
Association Rules and Market Basket Analysis Using Python
21:42
Data Science, Machine Learning, and Python
Рет қаралды 33
Transformers (how LLMs work) explained visually | DL5
27:14
3Blue1Brown
Рет қаралды 4,2 МЛН
The Dark Matter of AI [Mechanistic Interpretability]
24:09
Welch Labs
Рет қаралды 98 М.
Gemini AI is Killing Software Tutorials: Live Demo of the Changes
13:13
Training Site TV
Рет қаралды 72 М.
MIT 6.S191: Reinforcement Learning
1:00:19
Alexander Amini
Рет қаралды 66 М.
Language Model Merging - Techniques, Tools, and Implementations
35:23
Building Brain-Like Memory for AI | LLM Agent Memory Systems
43:31
Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей
00:19