DeepSeek R1 Explained: How did Chain of Thought, Reinforcement Learning & Model Distillation help?

  Рет қаралды 1,338

Ragnar Pitla (Make it Happen)

Ragnar Pitla (Make it Happen)

Күн бұрын

Пікірлер: 13
@RagnarPitla
@RagnarPitla 8 күн бұрын
Thank you for watching my video! You can find the GitHub links in the description below and right here. Wishing you a fantastic day! Don’t forget to join me on KZbin for more content. www.youtube.com/@RagnarPitla?sub_confirmation=1 Deepseek Hugging Face: huggingface.co/deepseek-ai Github: github.com/deepseek-ai/DeepSeek-R1 Deepseek V3: github.com/deepseek-ai/DeepSeek-V3
@Tamara-Jost
@Tamara-Jost 7 күн бұрын
Thank you for this „scientific“ explanation. It is really helpful! Looking forward to your video on finetuning
@RagnarPitla
@RagnarPitla 7 күн бұрын
Thanks so much, Tamar! I know it’s a bit political too, but my goal is to understand how they achieved it. Learning from their approach can help us apply similar strategies while ensuring ethical use through proper licensing or open research models.
@SalmanZaidiB96
@SalmanZaidiB96 5 күн бұрын
Ah, was searching through all the content for a Rag tech update!
@YCB-in7sf
@YCB-in7sf 7 күн бұрын
Waiting for another video on deepseek
@RagnarPitla
@RagnarPitla 6 күн бұрын
haha! i feel you. We all have our views but I know its the hot topic and even I had to learn and reaserch to Know how they did it.
@passage2enBleu
@passage2enBleu 7 күн бұрын
My comment disappeared.
@RagnarPitla
@RagnarPitla 7 күн бұрын
I did have your previous message in below!
DeepSeek R1 Explained to your grandma
8:33
AI with Alex
Рет қаралды 1,2 МЛН
Chain-of-thought explained | Aravind Srinivas and Lex Fridman
4:38
УЛИЧНЫЕ МУЗЫКАНТЫ В СОЧИ 🤘🏻
0:33
РОК ЗАВОД
Рет қаралды 7 МЛН
Жездуха 42-серия
29:26
Million Show
Рет қаралды 2,6 МЛН
Vampire SUCKS Human Energy 🧛🏻‍♂️🪫 (ft. @StevenHe )
0:34
Alan Chikin Chow
Рет қаралды 138 МЛН
Small Language Models Explained: The Future of Business Transformation
32:24
Ragnar Pitla (Make it Happen)
Рет қаралды 17 М.
How to Use Small Language Models for Industry Specific Use Cases
1:03:49
DeepSeek is a Game Changer for AI - Computerphile
19:58
Computerphile
Рет қаралды 1,3 МЛН
DeepSeek-R1 Paper Explained - A New RL LLMs Era in AI?
9:09
AI Papers Academy
Рет қаралды 68 М.
AI Is Making You An Illiterate Programmer
27:22
ThePrimeTime
Рет қаралды 299 М.
Reinforcement Learning from scratch
8:25
Graphics in 5 Minutes
Рет қаралды 119 М.
DeepSeek facts vs hype, model distillation, and open source competition
39:17
Cat Has no Fear While Messing with Deer || ViralHog
0:35
ViralHog
Рет қаралды 19 МЛН
Few People Know This Tips 🤫
1:00
Tool_Tips
Рет қаралды 22 МЛН
Ұялмаған әнші болады💥😍ШОККК
0:57
Жаңалық әлемі
Рет қаралды 522 М.