Advanced RAG 01: Small-to-Big Retrieval with LlamaIndex

  Рет қаралды 11,468

Sophia Yang

Sophia Yang

7 ай бұрын

Two primary techniques:
- Smaller Child Chunks Referring to Bigger Parent Chunks: Fetch smaller chunks during retrieval first, then reference the parent IDs, and return the bigger chunks.
- Sentence Window Retrieval: Fetch a single sentence during retrieval and return a window of text around the sentence.
🔗 Blog: / advanced-rag-01-small-...
🔗 Notebook: colab.research.google.com/git...
🔔 SUBSCRIBE to my channel: kzbin.info...
⭐ Stay in touch ⭐
📚 DS/ML Book Club: dsbookclub.github.io/
▶ KZbin: / sophiayangds
✍️ Medium: / sophiamyang
🐦 Twitter: / sophiamyang
🤝 Linkedin: / sophiamyang
💚 #ai

Пікірлер: 32
@danieltost
@danieltost 7 ай бұрын
Loved the small to big retrieval. Simple and elegant. Awesome work Sophia!!
@sivi3883
@sivi3883 7 ай бұрын
Thanks for sharing! Cannot wait to try this and see the difference in the quality of RAG output. Excited to hear more tricks in this series!!!
@assouliyoussef9912
@assouliyoussef9912 Ай бұрын
just brilliant , thank you
@MakeStuffWithAI
@MakeStuffWithAI 7 ай бұрын
Awesome tutorial!
@whlau6191
@whlau6191 7 ай бұрын
Sophia, thanks for your sharing! Your presentation is always clear, structural and target oriented. it is so nice that you are one of the speakers in HK pycon. See you! 😊
@JamesRBentley
@JamesRBentley 7 ай бұрын
Another great video👍
@olliegooding
@olliegooding Ай бұрын
great explanations!
@chukypedro818
@chukypedro818 7 ай бұрын
Awesome, I've tried your approach to RAG, and it is far better than the current Langchain RAG that I am using. You are the best, Sophia.😘
@TheCherryo
@TheCherryo 7 ай бұрын
I thought of something similar to this but never had the time to implement it myself. I am very excited to see all the advancements in RAG and Agent-like applications. Curious to see if there is going to be specialized models that work better with RAG applications.
@klaudioz_
@klaudioz_ 7 ай бұрын
Amazing content!!. Thanks for sharing 🚀
@SophiaYangDS
@SophiaYangDS 7 ай бұрын
Thank you 🙏😊
@ztq121121
@ztq121121 7 ай бұрын
Thank you for sharing this! I'm curious if there are any benchmarks or evaluations that quantify the extent of improvement that small-to-big retrieval can offer. Is there any research paper that we can use as reference?
@AkarshanBiswas
@AkarshanBiswas 7 ай бұрын
Actually I do not use llama_index. I have created a separate class which does cosine similarity search for my local LLM's internet search feature.. This small chunks big chunks method looks very interesting. I will watch this video again tomorrow. Very good information!! Thank you.
@SophiaYangDS
@SophiaYangDS 7 ай бұрын
Nice! Thanks! Yeah the small to big retrieval methods are useful 🙌
@eklok5000
@eklok5000 7 ай бұрын
Really interesting content! I am unfortunately not all that familiar with llms and those techniques so it is sometimes hard to follow. Do you have some resources to catch up on the fundamentals?
@crowofhermes
@crowofhermes 6 ай бұрын
Hi first of all, thank you for the great explanations! may I ask how does the first recursive retriever differs from utilizing `HierarchicalNodeParser + AutoMergingRetriever`? I will check in detail to see if there is something else but they seem pretty similar
@dhavalthakkar5147
@dhavalthakkar5147 7 ай бұрын
Sophia, is there any langchain implementation for the above techniques?
@amitk3098
@amitk3098 2 ай бұрын
Is there anyway I can implement this without langchain and llama-index? I am not in favor having my RAG solution dependent on them.
@avotchiezoa6627
@avotchiezoa6627 7 ай бұрын
where can i find your background video ? Which apps do you use for power point?
@user-ow4kc4fo4c
@user-ow4kc4fo4c 7 ай бұрын
How to compare with graph llama index?
@AbolfazlMahmoodi
@AbolfazlMahmoodi 7 ай бұрын
How to reduce cost of openai API call?
@VenkatesanVenkat-fd4hg
@VenkatesanVenkat-fd4hg 7 ай бұрын
Superr video, can you discuss multimodal retriever system where docs or pdf contains tables & text.
@SophiaYangDS
@SophiaYangDS 7 ай бұрын
Yes thanks for the suggestions. These are on my list 🙌
@liju7024
@liju7024 7 ай бұрын
I still don't understand the connection between child chunck and parent chunck. Can you explain more about it?
@sjsadjf7030
@sjsadjf7030 2 ай бұрын
Amazing video but I have a question. I would be very happy if you could respond; even though it enhances the accuracy a lot, recursive retriever takes so much time. Would persisting indexes to the disk also be a solution in this case? or is there an alternative way to reduce the time? Thank you very much!!
@sjsadjf7030
@sjsadjf7030 2 ай бұрын
oh, when I switched the embed model to default (ada-002-v2) it is now much faster, now I understand that it is not recursion related lol :D the main question has changed to how to reduce the embedding model costs haha
@Aldraz
@Aldraz 7 ай бұрын
There are times when some new AI technique comes up and I am like yay.. but then I realize it's something I had implemented a year ago xD
@user-wr4yl7tx3w
@user-wr4yl7tx3w 7 ай бұрын
Is this idea associated with a publication?
@oguzhanzobar1094
@oguzhanzobar1094 7 ай бұрын
looks pretty cool, but I keep getting a ValueError: "Got a larger chunk overlap (200) than chunk size (128), should be smaller." when I try to run the code under "Chunk References: Smaller Child Chunks Referring to Bigger Parent Chunk".
@nathanredin7785
@nathanredin7785 6 ай бұрын
add ",chunk_overlap=20" in your splitter
Embeddings - EXPLAINED!
12:58
CodeEmporium
Рет қаралды 5 М.
La revancha 😱
00:55
Juan De Dios Pantoja 2
Рет қаралды 63 МЛН
Super gymnastics 😍🫣
00:15
Lexa_Merin
Рет қаралды 100 МЛН
Which one is the best? #katebrush #shorts
00:12
Kate Brush
Рет қаралды 23 МЛН
Пробую самое сладкое вещество во Вселенной
00:41
Кушать Хочу
Рет қаралды 4,1 МЛН
How to Improve LLMs with RAG (Overview + Python Code)
21:41
Shaw Talebi
Рет қаралды 25 М.
LangChain - Advanced RAG Techniques for better Retrieval Performance
24:57
Coding Crashcourses
Рет қаралды 24 М.
LangGraph 101: it's better than LangChain
32:26
James Briggs
Рет қаралды 49 М.
Advanced RAG Techniques
5:06
Mosleh Mahamud
Рет қаралды 2,6 М.
LlamaIndex overview & use cases | LangChain integration
12:36
Sophia Yang
Рет қаралды 22 М.
RAG from the Ground Up with Python and Ollama
15:32
Decoder
Рет қаралды 23 М.
I wish every AI Engineer could watch this.
33:49
1littlecoder
Рет қаралды 51 М.
Advanced RAG Techniques with @LlamaIndex
48:35
Timescale
Рет қаралды 1,9 М.
LlamaIndex Sessions: 12 RAG Pain Points and Solutions
37:57
LlamaIndex
Рет қаралды 11 М.
Adding Agentic Layers to RAG
19:40
AI User Group
Рет қаралды 14 М.
La revancha 😱
00:55
Juan De Dios Pantoja 2
Рет қаралды 63 МЛН