Thoughts On A Month With Devin - From AnswerAI

  Рет қаралды 2,185

Prompt Engineering

Prompt Engineering

Күн бұрын

Пікірлер: 4
@RoryMacdonald-pfff
@RoryMacdonald-pfff 21 сағат бұрын
Thanks for the pointer to this write-up. Seems quite clear that agentic solutions in coding space aren’t there yet - but as you say, it’s likely only going to improve from here.
@jsbgmc6613
@jsbgmc6613 11 сағат бұрын
What LLM model is used in Devin for this test?
@SasskiaLudin
@SasskiaLudin 6 сағат бұрын
This is always the same issue, the utter lack of metacognitive integrated abilities, i.e. to have the agent self critically assess its own progress toward the goal and when not progressing toward it, backtrack to an alternate approach, meanwhile piling up the successfully completed subtasks (memorizing and indexing them, to have them available as stepping stones for further potential reuse), and iteratively doing so until actual first successful completion (an later trying to optimize it). What is particularly infuriating is when the system gets stuck in a never ending loop but this might also just be indicative of a too small context window relatively to the size of the code repository to simultaneously address and manage...
@engineerprompt
@engineerprompt 4 сағат бұрын
I think a potential solution would be to have two agents, one that performs a task the other verify. They needs to be completely independent.
7 Free AI Productivity Tools I Use Every Day
15:37
Futurepedia
Рет қаралды 516 М.
人是不能做到吗?#火影忍者 #家人  #佐助
00:20
火影忍者一家
Рет қаралды 20 МЛН
Cat mode and a glass of water #family #humor #fun
00:22
Kotiki_Z
Рет қаралды 42 МЛН
We Attempted The Impossible 😱
00:54
Topper Guild
Рет қаралды 56 МЛН
The 8 AI Skills That Will Separate Winners From Losers in 2025
19:32
Anthropic’s Blueprint for Building Lean, Powerful AI Agents
28:25
Prompt Engineering
Рет қаралды 35 М.
Zuck says its over for software engineers
9:58
Melkey
Рет қаралды 94 М.
Gemini 2.0 Flash Thinking - Does it Pass the Misguided Attention Test?
14:23
How AI Makes My Content 10x Better (Without Sounding Generic)
12:25
Mike and Matty
Рет қаралды 1,8 М.
Qwen 2.5 Coder 32B: Is This Best Open Weight Model Better than GPT-4o?
15:36
Microsoft Just Showed Us How To Use New AI Agents...
13:56
TheAIGRID
Рет қаралды 126 М.
Google's Blueprint to Building Powerful Agents
17:31
Prompt Engineering
Рет қаралды 24 М.
Become An AI Engineer in 2025 | The 6 Step Roadmap
16:01
Greg Kamradt
Рет қаралды 19 М.
人是不能做到吗?#火影忍者 #家人  #佐助
00:20
火影忍者一家
Рет қаралды 20 МЛН