DeepSeek R1 for Structured Agents

  Рет қаралды 17,400

Sam Witteveen

Sam Witteveen

Күн бұрын

Пікірлер: 34
@djstraylight
@djstraylight Күн бұрын
Great video and I came to the same conclusion. I use a inexpensive model to do tool calls (and perplexity searches) and then feed a perfectly formated query to a reasoning model. Then use a local (or gemini flash) to format it.
@jaredcluff5105
@jaredcluff5105 Күн бұрын
This is also what I am doing. Reasoning model for intellect, non reasoning for structured output and tool calling.
@Zerobytexai
@Zerobytexai Күн бұрын
Why is nobody ever mentioning in any KZbin video at all since this thing first came out that you can now use R1 AND Websearch at the same time? That's huge and everybody is literally freaking blind about it lol
@ajwo5984
@ajwo5984 Күн бұрын
Exactly
@pabloescobar2738
@pabloescobar2738 Күн бұрын
It maybe censure, all person thing chatgtp, its god, but nothing or few person, speak claude. Or r1 deepseek, this people pay 200 dollar its normal, they think i pay 200$ i need say this is the best, and i rescue my inversiones. If see task openai its. Claude maybe existe last years and nothing say... This stage ios VS android. In other video deepseek big model nivel exact with geminis, and level up chatgtp. But i think chatgtp this is no real agi , because literal base culture US. But not word becuase if you ask for Oeste, for real history american , but this not exist. It false.
@ShinyExample
@ShinyExample Күн бұрын
@@ajwo5984 In open router chat it worked from day one lol
@amandamate9117
@amandamate9117 18 сағат бұрын
combining with gemini flash to clean the output is genius move
@IdPreferNot1
@IdPreferNot1 Күн бұрын
I've been trying to build non agentic script to do tavily research with llm processes to refine search, clean results, use results, structure and build reports, and keep getting bogged down in multiple calls and handling data models. Just realized i can just use a pydantic AI agent simply as a tool in that process to run through those without me handling the overhead and still get the structured output for chaining processes i need! If this works it will be amazing....
@thegtlab
@thegtlab Күн бұрын
good stuff! I never thought of using the reasoning model as a tool!
@ibrahimhalouane8130
@ibrahimhalouane8130 20 сағат бұрын
Great video, thanks! I'll be honest, I'm afraid of putting more effort on these kind of tricks, it might go absolute in a few months or weeks! The racing is getting intense, we'll see what o3, Opus (according to some leaks)... will bring to the table.
@orthodox_gentleman
@orthodox_gentleman Күн бұрын
I am actually building an mcp server with reasoning tools and R1 as the sole contender all for Roo Cline.
@amoledzeppelin
@amoledzeppelin Күн бұрын
Not sure about the JSON output, but you can make it output XML (and wrap the answers into CDATA sections inside the tags) by having an appropriate prompt. At least this works with the local distilled versions. But thanks for a good idea: I could try getting the output from the r1:14b and feed it into e.g. qwen2.5-coder:3b as a formatting agent.
@ahmadzaimhilmi
@ahmadzaimhilmi Күн бұрын
Precisely what I need
@jakobkristensen2390
@jakobkristensen2390 18 сағат бұрын
Great video
@60pluscrazy
@60pluscrazy Күн бұрын
Thanks 🎉🎉🎉
@orthodox_gentleman
@orthodox_gentleman Күн бұрын
Apparently with reasoning models in the system prompt you aren’t supposed to give the model the whole persona thing such as “you are a reasoning model…”
@johanneskyprianou9076
@johanneskyprianou9076 Күн бұрын
What should you put there instead, instructions?
@nathannowack6459
@nathannowack6459 Күн бұрын
yeah just more nuance, it’s implicit that it will think step by step w reasoning models. not like it would hurt to say that tho, it’s just redundant
@clray123
@clray123 7 сағат бұрын
This recommendation is specific to R1 and based on the observation that it performs worse in some benchmarks when given a system prompt than without it. Which only goes to show how stupid these models still are.
@tk0150
@tk0150 Күн бұрын
I believe and after consulting with DeepSeek that smolagents being that it outputs write its own code in Python. there is not a Json structure needed.
@mrd6869
@mrd6869 Күн бұрын
using it to "weaponize" some cybersecurity tools, for penetration testing work.
@amandamate9117
@amandamate9117 18 сағат бұрын
dude we need examples
@MJ-hl7qw
@MJ-hl7qw Күн бұрын
Please make a video about how to use distill models like DeepSeek-R1-Distill-Qwen-32B. thanks
@samwitteveenai
@samwitteveenai Күн бұрын
Anything particularly you want to use it for ?
@MJ-hl7qw
@MJ-hl7qw Күн бұрын
@@samwitteveenai for document question answering, recommender sys. and can we use this model as agentic usecase like using in langchain?
@davidkochav924
@davidkochav924 Күн бұрын
Love it
@zandanshah
@zandanshah Күн бұрын
Best Part our Politicians are still fighting over Tic Tok. 🤣🤣🤣
@clray123
@clray123 7 сағат бұрын
Actually TikTok has more relevance to the power structure of the world than "reasoning" models that still trip up on their own shoelaces.
@francycharuto
@francycharuto Күн бұрын
Cool
@AICodeDev
@AICodeDev Күн бұрын
poor thing gemini 1.5 : you never give your own opinion! hahaha
@samwitteveenai
@samwitteveenai Күн бұрын
lol it’s not allowed to have an opinion now 2.0 is here
@clray123
@clray123 7 сағат бұрын
Knowing how these models work, it will give you its opinion anyway (from time to time, unpredictably).
@JNET_Reloaded
@JNET_Reloaded Күн бұрын
ollama locally is the cheapest way to do it, dont use apis that you have to pay forunless you really have to!
DeepSeekR1 - Full Breakdown
22:49
Sam Witteveen
Рет қаралды 21 М.
Build anything with DeepSeek R1, here’s how
21:36
David Ondrej
Рет қаралды 132 М.
Try this prank with your friends 😂 @karina-kola
00:18
Andrey Grechka
Рет қаралды 9 МЛН
Сестра обхитрила!
00:17
Victoria Portfolio
Рет қаралды 958 М.
人是不能做到吗?#火影忍者 #家人  #佐助
00:20
火影忍者一家
Рет қаралды 20 МЛН
How to OPTIMIZE your prompts for better Reasoning!
21:17
Sam Witteveen
Рет қаралды 14 М.
How I Code Apps SOLO That Actually Make Money (Idea + Build + Marketing Guide)
13:14
How to Build Effective AI Agents (without the hype)
24:27
Dave Ebbelaar
Рет қаралды 109 М.
DeepSeek R1 Fully Tested - Insane Performance
15:10
Matthew Berman
Рет қаралды 206 М.
This New AI Is Honestly SCARY: Real-Time Self-Learning!
9:23
AI Revolution
Рет қаралды 10 М.
Building a fully local "deep researcher" with DeepSeek-R1
14:21
PydanticAI - The NEW Agent Builder on the Block
21:45
Sam Witteveen
Рет қаралды 33 М.
AI is on Record Pace to BOOM! o3 mini, Grok 3, Operator & More!
25:15
Try this prank with your friends 😂 @karina-kola
00:18
Andrey Grechka
Рет қаралды 9 МЛН