Thanks so much! You are the best channel when it comes to RAG. Please keep informing us about the latest advancements in this field. I have played with GraphRag. It can be expensive if you have tons of data, but considering how cheap GPT-4o and GTP-4o-mini have become, the price is not the biggest concern at least for my use case. I processes more than a thousand page document with GPT-4o and it cost me cents. The biggest problem with MS GraphRag is the inference latency. It is not very practical if you want to build a chatbot based on this. Also, it is less customizable in my recent experience. Hope LightRag is better in terms of accuracy, customizability and inference speed.
@AndrewNanton5 күн бұрын
Pretty cool - if you do more with this, I'd love to see some experiements combining this with late chunking
@engineerprompt5 күн бұрын
good idea, when I get time, I want to implement a system that combines all these different approaches together.
@hiranga3 күн бұрын
@@engineerprompt Has anyone recreated this in LangGraph or LangChain JS ?
@johnkintree7634 күн бұрын
It looks like the pipeline is modular, and could work with different vector and graph databases. In the future, users could rate the responses, so highly rated responses could be stored and retrieved when similar queries are made by other users. Retrieving highly rated responses could improve overall system performance compared with generating responses for every query. Apply fast thinking instead of slow thinking when possible.
@yvescourtoisКүн бұрын
Great stuff, well done! I don't see any reason to use GraphRAG any more after that. I guess the technology will continue to surprise us, but the cost argument is powerful when you handle tons of data
@CAPSLOCK_USER5 күн бұрын
such a great channel, thanks for this guide, i was just about to implement a knowledge base!
@dawid_dahl4 күн бұрын
In a few years I suspect we will laugh at all these hacky RAG implementations as one will simply be able to dump everything into the context window and there will be some native mechanism to handle efficiency. What do you think?
@adegboyegaajenifuja12744 күн бұрын
Or you just grant access to your document repositories and you're good to go
4 күн бұрын
Needle in haystack problem of big context Windows....
@BryantAvila3 күн бұрын
To dump 10K documents each composed of at least 100 pages seems it will always be unrealistic. RAG of some sort will still be needed.
@dawid_dahl2 күн бұрын
@@BryantAvila Will be interesting to see how this comment ages over the comment years. (Or mine!)
@JoseAntonio-sn6sf13 сағат бұрын
nice video, I am just starting with RAGs, so sorry if my question is a little stupid, but if you spend 80k tokens running LightRAG, why the necessity to even implement a RAG when the book it self has 40k tokens? I mean wouldn't be easier to send the whole book to chatgpt?
@ahmadzaimhilmi4 күн бұрын
It costs me $0.02 per 10 page pdf document using openai api. I think it's pretty decent. Future improvements should also include storing all these json files in a database. Also, have you figured out how to get the reference chunks used to generate the response to the query?
@penguinmagehello3 күн бұрын
Mind sharing how/ which packages you use to split pdfs especially those with images?
@ahmadzaimhilmi3 күн бұрын
@@penguinmagehello google marker pdf to markdown
@trytry65695 күн бұрын
Please please please make a video in which we can use it with our local models.
@engineerprompt5 күн бұрын
on it :)
@trytry65695 күн бұрын
@@engineerprompt Thanks man🙌
@johnkintree7634 күн бұрын
@@engineerpromptThe Zamba2 family of models looks interesting for running locally with lower latency and more tokens/sec output.
@guscastilloa4 күн бұрын
How do you record your screen so that it follows your cursos? It’s super useful to follow along specially when reading papers! Kudos on this amazing expose of this methodology!
@dylanmoraes9902 күн бұрын
It's a video editing app called screen studio on mac
@RickySupriyadi5 күн бұрын
wow so many RAG system this year already
@engineerprompt5 күн бұрын
Yup, its hard to track but nice to see the different ideas that are coming up.
@ai_dart5 күн бұрын
Good Info
@mclachan2 күн бұрын
Have you tested it on structured data?
@muhammadshafiqsafian61494 күн бұрын
how can i view the algo flowchart? a bit complex to understand
@remusomega4 күн бұрын
Graphs solve the problem of chunk embeddings being de-contextualized. Late Chunking solves this problem. I think we need to re-consider the use cases for GraphRags.
@syedsaifullahtarique4 күн бұрын
How to create LightRAG object inside dicken folder
@dawid_dahl4 күн бұрын
Thanks for the really great video, by the way.
@ashgtd5 күн бұрын
great video thank you
@axelwehmeyer95995 күн бұрын
cool. Is there an implementation for Groq-API like gpt_4o_mini_complete-API in LightRAG? How can i use a GUI-Chatbot for LightRAG, e.g. chainlit/streamlit/...? thx
@engineerprompt5 күн бұрын
I think there are a couple of PRs for other models. Not sure about the GUI
@ingenierofelipeurreg5 күн бұрын
Which of all RAG is cost effective and quality better?
@engineerprompt5 күн бұрын
Its a hard question to answer. It will really depend on the task at hands. If you think its just search/looking up information, may be a standard RAG. If there are relationships between objects/entities then may be a knowledge graph.
@Jvo_Rien5 күн бұрын
thank you :)
@AhmedMagdy-ly3ng2 күн бұрын
Hey 👋 I'm one of the most exciting fans of you and I have the opportunity to come to India I wish I can see you and have a conversation.. Please 🥺🙏
@akshatgandhi79585 күн бұрын
Thanks
@themax2go5 күн бұрын
so essentially they implemented triples / triplets (sciphi/triplex) 🤔