Рет қаралды 492
Following the Strawberry launch, we'll survey a few related papers rumored to be relevant:
STaR: Boostrapping Reasoning with Reasoning (arxiv.org/abs/...)
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking (arxiv.org/abs/...)
V-STaR: Training Verifiers for Self-Taught Reasoners (arxiv.org/abs/...)
Join the LS paper club every wednesday: lu.ma/ls