The 4 Stacks of LLM Apps & Agents

  Рет қаралды 19,480

Sam Witteveen

Sam Witteveen

Күн бұрын

Пікірлер: 39
@donatocapitella
@donatocapitella Жыл бұрын
I love when people manage to describe an entire landscape or stack is such a direct and easy to navigate way, placing all the pieces of the puzzles together! Thanks for your videos!
@stonedizzleful
@stonedizzleful 11 ай бұрын
Great video. Was aware of all these different components of LLM apps but the way you've broken it down is excellent!
@FeliciaXiao
@FeliciaXiao 11 ай бұрын
I cannot express how much gratitude I had for this video. Super clear and informative.
@SpenceDuke
@SpenceDuke Жыл бұрын
I am thankful for your channel. You're helping me prepare to excel in my current role. I like this framework, its similar to how I sort of thought about things but not as explicit.
@Diego_UG
@Diego_UG Жыл бұрын
It is understood very well, and as you explain it, they look like modules in code, but I don't know if in practice this can be done in modules, I see in your videos all these separate features, such as the context compressor and filters, but could you show us in a demo of how to make an agent that has all these components well organized? It would be great to see how you extend the medical agent that you showed us in a previous video, which already has some of these parts, so that it has all 4 stacks, such as adding the context compressor and filters, in addition to the memory with a base of vector data like qdrant, they are just ideas. Thanks for teaching and sharing, greetings.
@samwitteveenai
@samwitteveenai Жыл бұрын
This is more a framework for thinking about the various parts and how you should approach putting something together. I have been working almost exclusively on Agents now since April and LLM apps for a few years. The putting things together is certainly changing as new frameworks like Autogen and others come out. A lot of the processes I tend to run as micro services as well. but the thinking about them is pretty stable. I am planning a whole series of videos on agents and to go through the mechanics more than just the "ohh look another new agent" videos that I see a lot out there.
@bingolio
@bingolio Жыл бұрын
@@samwitteveenai perhaps a minimal implementation showing all the separate stacks?
@mytj228
@mytj228 Жыл бұрын
Wow what a discovery I made by clicking on your video! Subscribed and liked. What a great content you have.
@jayasiriasa9080
@jayasiriasa9080 Жыл бұрын
Dear Sam, thanks a lot for sharing very informative video with a framework by putting all relevant items together. Waiting for a series on effective usage of agents. Thanks.
@kavachchandra3123
@kavachchandra3123 Жыл бұрын
This is simply amazing! Would love to see all of these models explored further in their individual videos where in short examples and applications of each model is being explored. Snippets of code would help but I see that its more of a thought process and way of thinking so analogous explanations would work as well. Looking forward to something similar.
@HaseebHeaven
@HaseebHeaven 9 ай бұрын
Here is a summary of the video in bullet list form with emojis: - 🧠 Intro: Overview of how LLMs can be used to build different types of applications and agents - 🧱 LLM Stack: LLMs organized into different "stacks" or layers that each handle different tasks - 🕵‍♂ Search/Memory/Data Stack: Handles data retrieval, question answering, search 🔍 - 🧠 Reasoning and Action Stack: Handles tasks, goal-oriented dialogues, decision making, automation 🤖 - 👨‍👩‍👧 Personalization Stack: Handles personalization, user preferences, conversations over time 🗣 - 🤝 Overall: The video discusses how LLMs can be organized and developed to handle various tasks through different "stacks" or layers of functionality.
@LaHoraMaker
@LaHoraMaker Жыл бұрын
Superb video! Regarding the reasoning stack, I would love a future video Autogen agents, testing different Open source models as coordinator of models with different capabilities like Mistral, Fuyu, SQLCoder or Pandalyst
@qchaldemer
@qchaldemer Жыл бұрын
This is a fantastic framework, thanks for sharing it. You are mentioning towards the end using LLama to get data out of a database, do you have any videos for that? I have seen the ones for using LLama 2 with RAG using vector store but not necessarily querying a database a bit similar to what OpenAI codecomposer would do
@tekrunner987
@tekrunner987 Жыл бұрын
He wasn't talking about Llama but LlamaIndex, which is a library of connectors between LLMs and various sources of data.
@collinshill4821
@collinshill4821 4 ай бұрын
Can you explain how you're using the word 'heuristics' with maybe an example?
@atchutram9894
@atchutram9894 Жыл бұрын
Thank you. What kind of problems can we solve with this? how to think about ideas for such a project?
@ai_product_manager
@ai_product_manager Жыл бұрын
What are you thoughts on using language models for data engineering. At first I thought that LLMs would be overkill for data engineering, but then I realized that I should be thinking about a small language model since cleaning data is a relatively small circle of tasks that an LLM can do. Combining that will search, I have a hunch that entire data pipelines can be replaced with a series of small language models. Thoughts?
@bingolio
@bingolio Жыл бұрын
THANK YOU. PLS share your related Google Talks etc.
Жыл бұрын
Thanks Sam for this insightful video, once again. The reasoning part is definitely the part requiring more work and it can bring significant differences in an app. Working on Kyroagent I can translate your video to a very similar structure. Also, one important layer that I think is missing here, is the Interface. How the user will interact with the app and agents is a key element in building apps. not only UI but the flow and refining how we can dynamically build interaction with a AI based app. A lot to come in this area too.
@tekrunner987
@tekrunner987 Жыл бұрын
This framework is probably not perfect, but it seems like a really good starting point to think about LLM apps systematically (which I find isn't easy to do at all when new tools, libs, methods, models and so on are coming out all the time). If I may offer a couple suggestions : * I feel like "LLM stack" is a bit of a misleading name, since LLMs may be involved in all stacks. I think it'd be better to use a name that describes the function of that stack (like the other 3), though I can't think of a suggestion just now. * For production-ready apps, we'll probably need some "meta-stacks" as well. I can think of at least security (I don't even know what that would look like, but LLMs definitely present some very specific security challenges) and observability (ensure that each element of the stack performs as required; seems especially important when you start using multi-step pipelines for RAG, agent interactions, etc.)
@joaooliveira7051
@joaooliveira7051 Жыл бұрын
Very clear an usefull! Thanks
@zd676
@zd676 Жыл бұрын
I would also point out there should be a Responsible AI stack, AI / Data security, compliance, governance, etc.
@theptrk
@theptrk Жыл бұрын
"What reasoning is the agent going to use"
@andrestorres7343
@andrestorres7343 Жыл бұрын
with reasoning he is referirng to the different techniques of promoting there are, .e.g., chain of thought
@tecnopadre
@tecnopadre Жыл бұрын
Great video. Thnks
@SonGoku-pc7jl
@SonGoku-pc7jl Жыл бұрын
i like so much this video! manual writing all sentences in fisic paper jaja xD good organitzacions, yes :)
@kwtan3814
@kwtan3814 Жыл бұрын
This is gold! Thank you Sam
@samwitteveenai
@samwitteveenai Жыл бұрын
Glad this is helpful, I wasn't sure if this kind of video would be useful or not.
@sivi3883
@sivi3883 Жыл бұрын
@@samwitteveenai This is absolutely very helpful! Please provide the videos on your google talks as well please!
@ricardocosta9336
@ricardocosta9336 Жыл бұрын
lets goooooo! good job mate
@micbab-vg2mu
@micbab-vg2mu Жыл бұрын
Great video - thank you.
@bakrianoo
@bakrianoo Жыл бұрын
Waiting for a video about how you can manage your time to produce these gems !! Special type of drugs 😂
@samwitteveenai
@samwitteveenai Жыл бұрын
lol finding the time and a quiet place to record is my biggest challenge when I am traveling (like I am currently). This one is actually taken from part of a live talk I gave early in the year so was easy to reuse it.
@bakrianoo
@bakrianoo Жыл бұрын
@@samwitteveenai I wish you all the best in your life. But preparing for the topic and making the slides are critical parts here. How can you always be ready to do this besides your main work?
@ackiamm
@ackiamm Жыл бұрын
Thanks Sir
@carterjames199
@carterjames199 Жыл бұрын
Please convert the google talks
@alx8439
@alx8439 Жыл бұрын
RP - role playing - should be considered as a kind of mastrubation and publicity banned :)
@klammer75
@klammer75 Жыл бұрын
You’re the best Sam! Thank you for this and keep bringing AI gold to us digital peasants🤩😎🦾
Zephyr 7B beta - How much does DPO really help?
12:51
Sam Witteveen
Рет қаралды 8 М.
The 5 Types of LLM Apps
15:30
Sam Witteveen
Рет қаралды 30 М.
Une nouvelle voiture pour Noël 🥹
00:28
Nicocapone
Рет қаралды 9 МЛН
UFC 310 : Рахмонов VS Мачадо Гэрри
05:00
Setanta Sports UFC
Рет қаралды 1,2 МЛН
Understanding ReACT with LangChain
21:10
Sam Witteveen
Рет қаралды 52 М.
Claude MCP + GitHub = The Future of AI Agents 🚀
18:19
Florenz Erstling
Рет қаралды 2,8 М.
Anthropic's New Agent Protocol!
15:35
Sam Witteveen
Рет қаралды 42 М.
OpenAI Embeddings and Vector Databases Crash Course
18:41
Adrian Twarog
Рет қаралды 516 М.
Multi-Agent AI EXPLAINED: How Magentic-One Works
16:39
Sam Witteveen
Рет қаралды 18 М.
AutoGen Advanced Tutorial - Build Incredible AI AGENT Teams
38:08
Matthew Berman
Рет қаралды 116 М.
LlamaOCR - Building your Own Private OCR System
17:06
Sam Witteveen
Рет қаралды 39 М.
RAG But Better: Rerankers with Cohere AI
23:43
James Briggs
Рет қаралды 64 М.