Рет қаралды 1,032
This video is a step-by-step tutorial to train your own reasoning model like R1 with GRPO in google colab with Unsloth.
🔥 Get 50% Discount on any A6000 or A5000 GPU rental, use following link and coupon:
bit.ly/fahd-mirza
Coupon code: FahdMirza
🔥 Buy Me a Coffee to support the channel: ko-fi.com/fahd...
🚀 This video is sponsored by EigentBOT that lets you deploy a personalized knowledge bot across platforms like Discord, Slack, etc. bot.eigent.ai
▶ Become a Patron 🔥 - / fahdmirza
#deepseekr1 #tgi #unsloth
PLEASE FOLLOW ME:
▶ LinkedIn: / fahdmirza
▶ KZbin: / @fahdmirza
▶ Blog: www.fahdmirza.com
RELATED VIDEOS:
▶ Resource colab.research...
All rights reserved © Fahd Mirza