Reliable, fully local RAG agents with LLaMA3.2-3b

  Рет қаралды 61,576

LangChain

LangChain

Күн бұрын

Пікірлер: 45
@homeandr1
@homeandr1 Ай бұрын
Great explanation, would be great to do one more tutorial using multimodal local RAG, considering the different chunks like tables, texts, and images, where you can use unstructured, Chroma, and MultiVectorRetriever completely locally.
@ytaccount9859
@ytaccount9859 Ай бұрын
Awesome stuff. Langgraph is a nice framework. Stoked to build with it, working through the course now!
@leonvanzyl
@leonvanzyl Ай бұрын
The tutorial was "fully local" up until the moment you introduced Tavily 😜😉. Excellent tutorial Lance 👍
@sergeisotnik
@sergeisotnik Ай бұрын
Any internet search, by definition, is no longer local. However, embeddings here are used from a third-party service (where only the first 1M tokens are free).
@starbuck1002
@starbuck1002 Ай бұрын
@@sergeisotnik Hes using nomic-embed-text embedding model locally, so there is no token cap at all.
@sergeisotnik
@sergeisotnik Ай бұрын
@@starbuck1002 It looks like you're right. I saw that `from langchain_nomic.embeddings import NomicEmbeddings` is used, which usually means an API call. But in this case, the initialization is done with the parameter `inference_mode="local"`. I didn’t check the documentation, but it seems that in this case, the model is downloaded from HuggingFace and used for local inference. So, you’re right, and I was wrong.
@ravivarman7291
@ravivarman7291 Ай бұрын
Amazing session and content explained very nicely in just 30 mins; Thanks so much
@adriangpuiu
@adriangpuiu Ай бұрын
@lance, please add langgraph documentation to the chat. the community will appreciate that. Let me know what you think
@joxxen
@joxxen Ай бұрын
You are amazing, like always. Thank you for sharing
@becavas
@becavas Ай бұрын
Why did you use lama3.2:3b-instruct-fp16 instead of lama3.2:3b?
@LandryYvesJoelSebeogo
@LandryYvesJoelSebeogo 23 сағат бұрын
may GOD bless you Bro
@henson2k
@henson2k 28 күн бұрын
You make LLM to do all hard work for candidates filtering
@sunderrajan6172
@sunderrajan6172 Ай бұрын
Beautifully done; thanks
@SavvasMohito
@SavvasMohito Ай бұрын
That's a great tutorial that shows the power of LangGraph. It's impressive you can now do this locally with decent results. Thank you!
@davesabra4320
@davesabra4320 27 күн бұрын
Thanks it is indeed very cool. Last time you used 32Gb , do you think this will run with 16Gb? memory.
@Togowalla
@Togowalla Ай бұрын
Great video. What tool did you use to illustrate the nodes and edges in your notebook?
@user-wr4yl7tx3w
@user-wr4yl7tx3w Ай бұрын
Can you consider doing an example of contextual retrieval that Anthropic recently introduced.
@marcogarciavanbijsterveld6178
@marcogarciavanbijsterveld6178 23 күн бұрын
I'm a med student interested in experimenting with the following: I'd like to have several PDFs (entire medical books) from which I can ask a question and receive a factually accurate, contextually appropriate answer-thereby avoiding online searches. I understand this could potentially work using your method (omitting web searches), but am I correct in thinking this would require a resource-intensive, repeated search process? For example, if I ask a question about heart failure, the model would need to sift through each book and chapter until it finds the relevant content. This would likely be time-consuming initially. However, if I then ask a different question, say on treating systemic infections, the model would go through the entire set of books and chapters again, rather than narrowing down based on previous findings. Is there a way for the system to 'learn' where to locate information after several searches? Ideally, after numerous queries, it would be able to access the most relevant information efficiently without needing to reprocess the entire dataset each time-while maintaining factual accuracy and avoiding hallucinations.
@JesterOnCrack
@JesterOnCrack 2 күн бұрын
I'll take a minute to try and asnwer your question to the best of my ability. Basically, what you are describing are ideas that seem sound for your specific application, but are not useful everywhere. Whenever you restrict search results, there is a chance you're not finding the 1 correct answer you needed. Speaking from experience, even a tiny chance of not finding what you need is enough to deter many customers. Of course, your system would have a tradeoff in efficiency - completing queries quicker. Bottom line is, there are ways to achieve this with clever data- and AI-engineering. I don't think that there is a single straightforward fix to your problem though.
@VictorDykstra
@VictorDykstra Ай бұрын
Very well explained.😊
@thepeoplesailab
@thepeoplesailab Ай бұрын
Very informative ❤❤
@AlexEllis
@AlexEllis Ай бұрын
Thanks for the video and sample putting all these parts together. What did you use to draw the diagram at the beginning of the video? Was it generated by a DSL/config?
@blakenator123
@blakenator123 Ай бұрын
looks like excalidraw
@ericlees5534
@ericlees5534 24 күн бұрын
why does he make it so easy..
@fernandobarros9834
@fernandobarros9834 Ай бұрын
Great tutorial! Is it necessary to add a prompt format?
@skaternationable
@skaternationable Ай бұрын
Using PromptTemplate/ChatPromptTemplate works as well. It seems that the .format here is equivalent to the `input_variables` param within the former 2 classes
@fernandobarros9834
@fernandobarros9834 Ай бұрын
@@skaternationable Thanks!
@andresmauriciogomezr3
@andresmauriciogomezr3 Ай бұрын
thank you
@beowes
@beowes Ай бұрын
Question: You have operator.add on the loopstep, but tnen increment the loopstep in the state too… am i wrong in that it would then incorrect?
@sidnath7336
@sidnath7336 Ай бұрын
If different tools require different key word arguments, how can these be passed in for the agent to access?
@developer-h6e
@developer-h6e 22 күн бұрын
is it possible to make agent that when provide with few hundred links extracts info in all links and store it
@serychristianrenaud
@serychristianrenaud Ай бұрын
thanks
@hari8568
@hari8568 Ай бұрын
Is there an elegant way to handle recursion errors?
@jamie_SF
@jamie_SF Ай бұрын
Awesome
@johnrogers3315
@johnrogers3315 Ай бұрын
Great tutorial, thank you
@ephimp3189
@ephimp3189 Ай бұрын
Is it possible to add a "fact checker" method? What if the answer is obtained from a document that gives false information? it would technically answer the question, just not be true
@aiamfree
@aiamfree Ай бұрын
it's sooooo fast!
@ghostwhowalks2324
@ghostwhowalks2324 Ай бұрын
amazing stuff which can be done with few lines of code. disruption coming everywhere
@HarmonySolo
@HarmonySolo Ай бұрын
LangGraph is too complicated, you have to implement State, Node etc. I would prefer to implement the Agent workflow by myself, which is mush easier at least I do not need to learn how to use LangGraph
@generatiacloud
@generatiacloud 24 күн бұрын
Any repo to share?
@RazorCXTechnologies
@RazorCXTechnologies 23 күн бұрын
Excellent tutorial! Another easier option is to use n8n instead because it has Langchain integration with AI agents built in and almost no code required to achieve same functionality. N8n also has automatic chatbot interface and webhooks.
@kgro353
@kgro353 14 күн бұрын
langflow is best solution
@_Learn_With_Me_EraofAI
@_Learn_With_Me_EraofAI Ай бұрын
Unable to access chat ollama
@HELLODIMENSION
@HELLODIMENSION Ай бұрын
You have no idea how much u saved me 😂 salute 🫡 thank u.
Reliable, fully local RAG agents with LLaMA3
21:19
LangChain
Рет қаралды 117 М.
Unlimited AI Agents running locally with Ollama & AnythingLLM
15:21
Tim Carambat
Рет қаралды 154 М.
小路飞还不知道他把路飞给擦没有了 #路飞#海贼王
00:32
路飞与唐舞桐
Рет қаралды 84 МЛН
Motorbike Smashes Into Porsche! 😱
00:15
Caters Clips
Рет қаралды 23 МЛН
Types of AI: Narrow, General, and Superintelligent
10:16
Ian Ochieng AI
Рет қаралды 10
Build a Customer Support Bot | LangGraph
47:08
LangChain
Рет қаралды 47 М.
host ALL your AI locally
24:20
NetworkChuck
Рет қаралды 1,3 МЛН
Harvard Presents NEW Knowledge-Graph AGENT (MedAI)
38:36
Discover AI
Рет қаралды 71 М.
Conceptual Guide: Multi Agent Architectures
8:58
LangChain
Рет қаралды 23 М.
This RAG AI Agent with n8n + Supabase is the Real Deal
16:27
Cole Medin
Рет қаралды 73 М.
LightRAG: A More Efficient Solution than GraphRAG for RAG Systems?
19:49
Prompt Engineering
Рет қаралды 33 М.
Using Ollama and N8N for AI Automation
13:43
Matt Williams
Рет қаралды 35 М.
小路飞还不知道他把路飞给擦没有了 #路飞#海贼王
00:32
路飞与唐舞桐
Рет қаралды 84 МЛН