DeepSeek R1 for Structured Agents

Рет қаралды 17,400

Күн бұрын

Пікірлер: 34

@djstraylight Күн бұрын

Great video and I came to the same conclusion. I use a inexpensive model to do tool calls (and perplexity searches) and then feed a perfectly formated query to a reasoning model. Then use a local (or gemini flash) to format it.

@jaredcluff5105 Күн бұрын

This is also what I am doing. Reasoning model for intellect, non reasoning for structured output and tool calling.

@Zerobytexai Күн бұрын

Why is nobody ever mentioning in any KZbin video at all since this thing first came out that you can now use R1 AND Websearch at the same time? That's huge and everybody is literally freaking blind about it lol

@ajwo5984 Күн бұрын

Exactly

@pabloescobar2738 Күн бұрын

It maybe censure, all person thing chatgtp, its god, but nothing or few person, speak claude. Or r1 deepseek, this people pay 200 dollar its normal, they think i pay 200$ i need say this is the best, and i rescue my inversiones. If see task openai its. Claude maybe existe last years and nothing say... This stage ios VS android. In other video deepseek big model nivel exact with geminis, and level up chatgtp. But i think chatgtp this is no real agi , because literal base culture US. But not word becuase if you ask for Oeste, for real history american , but this not exist. It false.

@ShinyExample Күн бұрын

@@ajwo5984 In open router chat it worked from day one lol

@amandamate9117 18 сағат бұрын

combining with gemini flash to clean the output is genius move

@IdPreferNot1 Күн бұрын

I've been trying to build non agentic script to do tavily research with llm processes to refine search, clean results, use results, structure and build reports, and keep getting bogged down in multiple calls and handling data models. Just realized i can just use a pydantic AI agent simply as a tool in that process to run through those without me handling the overhead and still get the structured output for chaining processes i need! If this works it will be amazing....

@thegtlab Күн бұрын

good stuff! I never thought of using the reasoning model as a tool!

@ibrahimhalouane8130 20 сағат бұрын

Great video, thanks! I'll be honest, I'm afraid of putting more effort on these kind of tricks, it might go absolute in a few months or weeks! The racing is getting intense, we'll see what o3, Opus (according to some leaks)... will bring to the table.

@orthodox_gentleman Күн бұрын

I am actually building an mcp server with reasoning tools and R1 as the sole contender all for Roo Cline.

@amoledzeppelin Күн бұрын

Not sure about the JSON output, but you can make it output XML (and wrap the answers into CDATA sections inside the tags) by having an appropriate prompt. At least this works with the local distilled versions. But thanks for a good idea: I could try getting the output from the r1:14b and feed it into e.g. qwen2.5-coder:3b as a formatting agent.

@ahmadzaimhilmi Күн бұрын

Precisely what I need

@jakobkristensen2390 18 сағат бұрын

Great video

@60pluscrazy Күн бұрын

Thanks 🎉🎉🎉

@orthodox_gentleman Күн бұрын

Apparently with reasoning models in the system prompt you aren’t supposed to give the model the whole persona thing such as “you are a reasoning model…”

@johanneskyprianou9076 Күн бұрын

What should you put there instead, instructions?

@nathannowack6459 Күн бұрын

yeah just more nuance, it’s implicit that it will think step by step w reasoning models. not like it would hurt to say that tho, it’s just redundant

@clray123 7 сағат бұрын

This recommendation is specific to R1 and based on the observation that it performs worse in some benchmarks when given a system prompt than without it. Which only goes to show how stupid these models still are.

@tk0150 Күн бұрын

I believe and after consulting with DeepSeek that smolagents being that it outputs write its own code in Python. there is not a Json structure needed.