LLM Self-Taught Reasoning - Explained!

  Рет қаралды 951

CodeEmporium

CodeEmporium

Күн бұрын

Пікірлер
@EricKaysan
@EricKaysan 17 күн бұрын
Great job!
@EobardUchihaThawne
@EobardUchihaThawne 18 күн бұрын
why do you think task spesific models (basic, non llm models) cant do arithmetic without cot dataset?
@willw4957
@willw4957 19 күн бұрын
How does the back-propagation, fine-tuning and inference work though? The rationale is a more detailed answer, this is bootstrapping the dataset with model outputs hoping theres enough context in the question answer to generate a rationale? which is probably why the rationales are still wrong.
@CodeEmporium
@CodeEmporium 19 күн бұрын
For backprop, the output answer (without the rationale) generated during the rationale generation phase is compared to the label output. From this, we can get a loss and hence backprop comes in for the network to learn during the fine tuning phase. The issue here, and with STaR is that even though the answer may be right, the rationale could be wrong
@CyberwizardProductions
@CyberwizardProductions 5 күн бұрын
it was a good video - until you got to quiz time and decided to try to click your tongue and make incredibly annoying jeopardy sounds. that cost you a like and a subscribe
LLM Agents - Explained!
14:13
CodeEmporium
Рет қаралды 1,3 М.
Coding Was HARD Until I Learned These 5 Things...
8:34
Elsa Scola
Рет қаралды 705 М.
Симбу закрыли дома?! 🔒 #симба #симбочка #арти
00:41
Симбочка Пимпочка
Рет қаралды 5 МЛН
Noodles Eating Challenge, So Magical! So Much Fun#Funnyfamily #Partygames #Funny
00:33
Farmer narrowly escapes tiger attack
00:20
CTV News
Рет қаралды 13 МЛН
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 1,3 МЛН
2 Years of C++ Programming
8:20
Zyger
Рет қаралды 9 М.
Universities are failing students. Let's talk about why.
20:45
Jared Henderson
Рет қаралды 106 М.
before you code, learn how computers work
7:05
Low Level
Рет қаралды 525 М.
RAG - Explained!
30:00
CodeEmporium
Рет қаралды 3 М.
Transformers (how LLMs work) explained visually | DL5
27:14
3Blue1Brown
Рет қаралды 3,8 МЛН
I never understood why you can't go faster than light - until now!
16:40
FloatHeadPhysics
Рет қаралды 4,1 МЛН
How on Earth does ^.?$|^(..+?)\1+$ produce primes?
18:37
Stand-up Maths
Рет қаралды 426 М.
Симбу закрыли дома?! 🔒 #симба #симбочка #арти
00:41
Симбочка Пимпочка
Рет қаралды 5 МЛН