Deepseek AI's R1 vs.

  Рет қаралды 36

Kautilya - Your Success Partner

Kautilya - Your Success Partner

Күн бұрын

How does R1 stack up against O1? Not just in terms of evals, but in everyday usage?
With all the hype around R1, I couldn’t resist testing it out. And to make things fair, I brought in Anthropic's Claude to act as a judge for the outputs (yes, LLM as a judge-how meta is that?).
💡 Check out the video recording where I gave both R1 and O1 the same prompt and had Claude evaluate the results.
Initial Impressions:
✅ R1’s unique feature of showcasing its thought process is super interesting and adds a new layer of transparency to how it works.
✅ O1 delivers slightly better outputs in terms of detail and clarity, but…
✅ R1 at its price point is an incredible value-no complaints here. (For Consumers its free to use @ chat.deepseek....)
The Prompt I Used:
"You are tasked to enter the Indian OTT market. How would you go about it? What aspects would you consider? Come up with a detailed strategy and steps for execution."
Both LLMs approached the task differently, which made the comparison even more exciting.
✨ What are your thoughts? Have you tested R1 or O1 yet? What’s your go-to LLM and why? Let’s discuss in the comments!

Пікірлер
China announces retaliatory tariffs on US goods
5:29
Al Jazeera English
Рет қаралды 228 М.
What if all the world's biggest problems have the same solution?
24:52
JISOO - ‘꽃(FLOWER)’ M/V
3:05
BLACKPINK
Рет қаралды 137 МЛН
The 8 AI Skills That Will Separate Winners From Losers in 2025
19:32
Nvidia Just Revealed The Future Of AI Agents In 2025..
12:48
TheAIGRID
Рет қаралды 140 М.
Nvidia CEO Huang New Chips, AI, Musk, Meeting Trump
15:28
Bloomberg Technology
Рет қаралды 224 М.
How to make Muilt-Agent Apps with smolagents
22:08
Sam Witteveen
Рет қаралды 14 М.
AutoGen Tutorial 🚀 Create Custom AI Agents EASILY (Incredible)
20:10
JISOO - ‘꽃(FLOWER)’ M/V
3:05
BLACKPINK
Рет қаралды 137 МЛН