How to set up RAG - Retrieval Augmented Generation (demo)

  Рет қаралды 41,789

Don Woodlock

Don Woodlock

Күн бұрын

Пікірлер: 117
@AdityaSikaria-ig3rw
@AdityaSikaria-ig3rw 4 күн бұрын
the code walkthrough of exactly how this works is amazing. The most clear and concise I have seen so far
@davidgarces3584
@davidgarces3584 3 ай бұрын
I have been doing dozens of trainings and courses in the past months and I have to say this was the absolute best deep dive into RAG!!! I loved how u showed what this looks like in code. Most explanations are completely theoretical. This was amazing! please continue to post videos on (gen)AI!!!! I would personally would love to understand how a system (CRM, etc.) is integrated in this architecture. Anybody else interested in this? Please like!
@frack4oil16
@frack4oil16 6 ай бұрын
I’m a cybersecurity professional but I don’t code. More like an infrastructure guy. But now I’ve become very interested in AI security. Coding was always daunting for me. This guy made it look easy as pie. Thank you!
@jathinjagannath
@jathinjagannath 11 күн бұрын
Awesome demo nothing beats a good hands on real world example!
@funmitoadeyemi4647
@funmitoadeyemi4647 8 ай бұрын
You really are a teacher, made it consumable to RAG newbies, this is how videos should be lol. Thank you very much
@belugaaa6649
@belugaaa6649 8 ай бұрын
Awesome Don, please do more of these videos explaining more about AI/ML next topic: one on Lang chain or llama model 3 which was released recently.
@dwoodlock
@dwoodlock 7 ай бұрын
Yes - I'll take a look.
@belugaaa6649
@belugaaa6649 7 ай бұрын
@@dwoodlock Thank you dan you are amazing
@HarmonyWang-u9j
@HarmonyWang-u9j 2 ай бұрын
This is so clear! I am a beginner who was able to learn the whole process with your video, thank you so much!
@odysseuspolynoos9335
@odysseuspolynoos9335 2 ай бұрын
You are just great, as you know your subject and you have the will to elucidate us. Many thanks and all the best!
@govindarajram8553
@govindarajram8553 7 ай бұрын
It's so good to see a good demo, please put more videos, I would specifically looking for RDMS database to vector database to store embeddings so my user can use AI
@dwoodlock
@dwoodlock 6 ай бұрын
Try InterSystems IRIS.
@arvindms9
@arvindms9 3 ай бұрын
This was one of the best videos on RAG. Very nicely done!
@nadellaella6416
@nadellaella6416 8 ай бұрын
The way you conveyed that information was so structured and informative. Your explanation made me grasp the concept quickly. Thank You!!!
@sanjeebsahoo6145
@sanjeebsahoo6145 19 күн бұрын
Amazing video. Thank you sir .
@applegaming3461
@applegaming3461 19 күн бұрын
Awesome! Well explained 👍
@akashacharya2813
@akashacharya2813 6 ай бұрын
I am full stack developer trying to get into machine learning /ai industry workspace. Your videos are motivation for people like us.
@starter_culture
@starter_culture 7 ай бұрын
Best explanation I have seen to date of how RAG works.
@joeldunand5759
@joeldunand5759 29 күн бұрын
It is so well put togther and presented. Thank you so much :)
@nayanchoudhary4353
@nayanchoudhary4353 5 ай бұрын
I had been looking for the full set steps for a year now. This is easiest and very good video content! Thank you for teaching such important use of LLMs!
@OstaNazir
@OstaNazir 8 ай бұрын
I loved this explanation of RAG. I’ll share this with my team.
@swethaanil5472
@swethaanil5472 5 ай бұрын
I tried the approach with a product catalog dataset and was able to get similar results. You are a wonderful teacher . Thank you so much for this video and for the education. Looking forward to the next demo.
@sandyna1k
@sandyna1k 4 ай бұрын
How did you set up. I tried to do what he showed in the video but cannot get past initial errors. For example- OpenAI not found
@aditiajaykumar6381
@aditiajaykumar6381 3 ай бұрын
what website you used and how did you download the website? any suggestions?
@fitnessismypassion
@fitnessismypassion 5 ай бұрын
Amazing job Don! Subscribed as soon as I have seen your first video about RAG and this one is even greater. Thank you very much, please keep making videos. You have a talent for teaching to lay audience.
@dbose_ai
@dbose_ai 7 ай бұрын
Unbelievable content. Couldn't have been any simpler than this. Thank you so much.
@nomorecramps
@nomorecramps 6 ай бұрын
This is awesome! Loved the explanation and walkthrough. Definitely sharing this with all folks i come across who'd need help with this!
@altruistization
@altruistization 16 күн бұрын
I would have loved to see you emphasize the fact that LLM is being used as an agent and that that’s what makes Agentic AI so powerful. I wonder why creators of videos like these don’t stress or repeat that key aspect when trying to explain a new concept!
@dwoodlock
@dwoodlock Күн бұрын
I have to do one on agentic AI soon.
@GiantNeuron
@GiantNeuron 4 ай бұрын
Very useful! Amazing explanation!
@penpen9379
@penpen9379 3 ай бұрын
Amazing video! Thanks Don.
@boseashish
@boseashish 3 ай бұрын
beautiful detaining. thankyou for a kind video
@MohitGupta-cz9ok
@MohitGupta-cz9ok 7 ай бұрын
Thanks, after watching your video I got clear understanding how we can implement RAG.
@parzivalplayer9636
@parzivalplayer9636 5 ай бұрын
Fantastic stuff here.I hope you continue to explain the complex topics.
@jacksonhanson154
@jacksonhanson154 8 ай бұрын
Very thankful Don. It would be awesome to get some timestamps throughout the video, but this is incredible as is.
@dwoodlock
@dwoodlock 7 ай бұрын
Ah. Good idea.
@dwoodlock
@dwoodlock 7 ай бұрын
Done!
@jacksonhanson154
@jacksonhanson154 7 ай бұрын
@@dwoodlock Awesome
@HB-dj6eq
@HB-dj6eq 5 ай бұрын
Excellent presentation, well done. The code really helped.
@MWileY-nj1yb
@MWileY-nj1yb 6 ай бұрын
I appreciate you immensely! Awesome concise clear information. I really hope you continue to make available content like this.
@SantK1208
@SantK1208 6 ай бұрын
You made my demo remarkable, I explained my audience easily ❤❤❤❤❤❤
@bhaskarmazumdar9478
@bhaskarmazumdar9478 8 ай бұрын
Appreciate you taking the time to get into the weeds
@arivolis
@arivolis 6 ай бұрын
Great demo and explanation . Thank you for sharing the knowledge !
@hemantgaikwad3352
@hemantgaikwad3352 5 ай бұрын
Good content. Thank you. Looking forward to next video.
@damnguyenduyan
@damnguyenduyan 7 ай бұрын
Amazing video, save me ton of days to study. Thank you very much Mr. Don
@chetanpdeshpande3439
@chetanpdeshpande3439 8 ай бұрын
This was fantastic! Thank you!
@juanluisesteban7394
@juanluisesteban7394 3 ай бұрын
Really good. Thanks!
@secondsleep
@secondsleep 6 ай бұрын
Great video, easy to follow, highly informative. Thanks!
@yogeshsuryamurthy5939
@yogeshsuryamurthy5939 8 ай бұрын
Many thanks, Don. Your explanation was simple, clear and very helpful. To ensure that the top 5 answers are more relevant to the prompt questions, how can we verify and validate the model's output ? How can we test the bias in the data being prepared using the CSV, which is necessary to obtain the top answers? My other question is, how can we integrate these models in real-time? i.e. how to integrate these models into different HIS and CIS touchpoints (or any healthcare system, really), how to run and train these models, what additional computation (hardware requirements and increase in total cost of ownership) might be needed, and how to confirm User Acceptance Tests and seek customer sign-off before moving to production.
@dwoodlock
@dwoodlock 7 ай бұрын
In terms of validation, you will need to do some of this by hand - meaning a human reading the answers and verifying that they are accurate, that they represent the right tone, and that your model isn't responding to things that are off topic and/or are dangerous. These 3 could be separate validation threads.
@yogeshsuryamurthy5939
@yogeshsuryamurthy5939 7 ай бұрын
@@dwoodlock Thanks Don and very helpful.
@LGEightZero
@LGEightZero 4 ай бұрын
Thanks for this. How would this differ if it were not web based public data, but actually patient data for example? How is access to permissions based sensitive data handled in this case?
@souravmazumder58
@souravmazumder58 5 ай бұрын
Thanks for the video. Is there a way I can understand how the costing works when Im using RAG?
@sjmediaonline
@sjmediaonline 7 ай бұрын
Great video! Could you please do one video on RAG with knowledge graph? Thanks!
@BKNaveen
@BKNaveen 7 ай бұрын
Great explanation! Is there a way i can get the code you show in the jupyter notebook please? Would like to play around with it using my dataset. That would be very helpful!
@porter.anderson11
@porter.anderson11 6 ай бұрын
Were you able to find the code shown in Jupyter or something similar?
@aanyaaryancoolgamers2013
@aanyaaryancoolgamers2013 7 ай бұрын
Great video! where can I get the code used in the video?
@prakashshekade
@prakashshekade 6 ай бұрын
Amazing Don. I wonder why is there a need to use Langchain. Can you please throw some light?
@Themojii
@Themojii 7 ай бұрын
Hi Don, thank you for the great video. It would be great if you could share the notebook codes 😊
@The_Reductionist
@The_Reductionist 7 ай бұрын
How did you turn the website into a csv file? web-scraper? if so, which one?
@dwoodlock
@dwoodlock 6 ай бұрын
I wrote it myself using the beautiful soup library in python. It wasn't too complicated.
@ajaykumarsugumaran3740
@ajaykumarsugumaran3740 4 ай бұрын
Great video. Could you explain when would we use RAGs? Is it supposed to be used almost always with an LLM? Or RAGs have special use cases?
@fazilkagdi3290
@fazilkagdi3290 8 ай бұрын
Always wanted to understand how everyone all of a sudden made their own bots. Now I understand how simple it is.
@JI9131
@JI9131 Ай бұрын
Thanks for the video! It’s great to see how the RAG really works. I understand that RAG is quite good at organizing answers with text. How about replacing the CSV files with large Excel sheets and needing answers from different columns in the same row? Will the RAG perform well in that case? I find that while the response is often correct, sometimes it jumps to different rows and makes up answers. What is your take on using Excel sheets as documents for RAG?
@mariostudio7
@mariostudio7 17 сағат бұрын
How do we obtain the source data and can we read/extract data from pdf files the same way?
@raghureddy1746
@raghureddy1746 3 ай бұрын
Thank you Thank you!!!!
@vladd.9264
@vladd.9264 5 ай бұрын
Hi Don, I am learning that now and also doing something with assistant based on given material, like healthcare website in your case. So watched this RAG video to learn the concept. The question is that really needed to do the embedding and calculate the vector distances as opposed to just uploading all content (website text) to the storage and let the LLM to use it through a prompt. Means in this case the engine should basically do the same distance search, maybe cutting the material to smaller pieces than uploaded (?) and finally stick to the least distant for the answer. Do you think your more manual approach can generate better answers?
@peterezkka
@peterezkka 8 ай бұрын
love it, thank you! I appreciate these videos.
@cj5214
@cj5214 3 ай бұрын
this is my first Vedio on AI, just trying this sample code, it keep asking API key ? how can i manage it?
@CalvHobbes
@CalvHobbes 6 ай бұрын
Thank you, this was one of the most useful hands on tutorials and really cleared the confusion. I wonder if you plan to share the Jupyter notebook you used and if there is an upcoming (or existing) video that address the step you took to gather the input data (i.e. scraping the website to create the csv). It would be so useful to know about techniques for data retrieval and storage from bigger sites/ other data sources, and what would be different if you didn't have the liberty to only pick the top 4 closest embeddings. E.g. what if it were a retail web site and the user asked "what dresses do you have in blue?". Thank you once again.
@dwoodlock
@dwoodlock 6 ай бұрын
I've had a few questions on the 'scraping the website' step. I just basically wrote a python script, using a few libraries, that navigated through the website from the top, and stored the documents.
@CalvHobbes
@CalvHobbes 6 ай бұрын
Thank you
@mohammadnour8114
@mohammadnour8114 6 ай бұрын
can you share the notebook and the dataset?
@ajaykumarsugumaran3740
@ajaykumarsugumaran3740 4 ай бұрын
Excellent Don, the video explained RAG as neatly as possible! 1. Is it possible to share the code and the web source and web data csv file so we can follow along for practice? 2. Then openAI model used here for embedding, did you have to pay for using that? 3. Could you cover a separate video on current AI challenges related to AI ethics and ways to tackle them? Eg: AI used to create spam/ fake profiles, generate non consensual adult content images etc.
@sandyna1k
@sandyna1k 4 ай бұрын
I have the same questions. Did you find resolution
@ajaykumarsugumaran3740
@ajaykumarsugumaran3740 4 ай бұрын
@@sandyna1k 1. You can try web scraping or sites like httracker. 2. Looks like it's a paid version of LLM. 3. Haven't found an answer yet.
@appa9450
@appa9450 6 ай бұрын
My question is, say you want to make a chatbot that gives responses to questions about workflow and how to do certain things within a company. Would it be possible to connect it to a database that holds lots of documentation about a company?
@dwoodlock
@dwoodlock 6 ай бұрын
Yes certainly. With the approach I outlined, you can connect to documents within a database. You would just need to store the embeddings into the database itself, or a separate database that supports that.
@sdaiwepm
@sdaiwepm 2 ай бұрын
What a wonderful example! Before I run through this from my own Jupyter Notebook - do you recall roughly how much it cost in OpenAI credits?
@gridplan
@gridplan 6 ай бұрын
Suppose I wanted the answer to cite its source(s) -- in your example, the web page(s) that went into the final answer. How would that change this approach? Presumably, I wouldn't want to encode the web page's URL in the same vector as the text as that would skew the similarity calculation (dot product).
@dwoodlock
@dwoodlock 6 ай бұрын
Yes - you could put a reference in each of the documents that you choose and ask the LLM for the references back for the sections that it used. Given their are cryptic (like you could replace the URLs with unique IDs), they wouldn't really skew the embeddings.
@gridplan
@gridplan 6 ай бұрын
@@dwoodlock That makes sense, especially if I use IDs rather than URLs like you're suggesting. Thank you!
@ionuta.5359
@ionuta.5359 7 ай бұрын
Simple and clear! Thanks!
@sanjaysitaramsoni
@sanjaysitaramsoni 7 ай бұрын
I really fall in love with RAG, I am a Master student and I want to do some research on RAG for my upcoming Thesis Project. Can I have some suggestion from you ? articles or publications ? Thanks for this video
@bullsbearsfight
@bullsbearsfight 6 ай бұрын
awesome explanation
@asheeshmathur
@asheeshmathur 7 ай бұрын
Excellent, made so simple.😃
@divyabhasin1379
@divyabhasin1379 7 ай бұрын
Great video very well explained. Do you also have a video on how to download a website into a CSV?
@dwoodlock
@dwoodlock 6 ай бұрын
Not yet. I may do one - lots of you have asked.
@jdcage
@jdcage 7 ай бұрын
Great video! How did you get the content of the website? I am working on a similar project, and most of my effort seems to be scraping pages.
@gHosT_scovoxwZ
@gHosT_scovoxwZ 4 ай бұрын
can you please share link to get the csv file, so that i can practice setting up this.
@sibims653
@sibims653 8 ай бұрын
Greate Explanation
@Sanu-pl8is
@Sanu-pl8is 7 ай бұрын
How can we use a csv file with multiple fields as the prompt may require to evaluate all fields and answer the best one.
@dwoodlock
@dwoodlock 6 ай бұрын
Most people will turn the multiple fields into a sentence or a paragraph. Then you have a document.
@maa1dz1333q2eqER
@maa1dz1333q2eqER 5 ай бұрын
Great Job, Thanks!!!!
@fernandes1431
@fernandes1431 6 ай бұрын
how do you get the webpage dataset?
@dwoodlock
@dwoodlock 6 ай бұрын
I wrote a little python script that scraped the website by navigating it from the top like a tree-based search and stored the text of each page.
@mdtanbeerjubaer584
@mdtanbeerjubaer584 4 ай бұрын
Can you please share the csv file so that i can practice. I am a new researcher will learn these for my thesis.
@mzimmerman1988
@mzimmerman1988 7 ай бұрын
helpful! thanks.
@pptmtz
@pptmtz 6 ай бұрын
Thanks, this is so cool
@nabinbhusalofficial
@nabinbhusalofficial 6 ай бұрын
Very Helpful, Can you Please make a video on RAG for low resource language like Nepali?
@pjali1
@pjali1 7 ай бұрын
Awesome Sir !!
@baladin06
@baladin06 2 ай бұрын
Thanks!
@ernestboakyedanquah3661
@ernestboakyedanquah3661 27 күн бұрын
Can we have the codes for the is demo?
@Tessi42
@Tessi42 7 ай бұрын
Wow, Thank You
@jehnzcode
@jehnzcode 6 ай бұрын
👏 new subscriber here
@050zaher
@050zaher 5 ай бұрын
You are great
@sohans3550
@sohans3550 6 ай бұрын
is it possible to share CSV file Don?
@rahuln3977
@rahuln3977 6 ай бұрын
hi sir can i get this code to replicate on my jupyter notebook
@paultoensing3126
@paultoensing3126 7 ай бұрын
So what is df?
@agusriyadi6802
@agusriyadi6802 7 ай бұрын
Distance Formula I think
@dwoodlock
@dwoodlock 6 ай бұрын
The primary data structure used in data science is called a Data Frame. It's just like a SQL table. The rows are typically instances and the columns are typically attributes/fields/properties of each instance. Hence the variable df.
@matiasgarces2465
@matiasgarces2465 6 ай бұрын
For dummies please!!! step by step
@paultoensing3126
@paultoensing3126 7 ай бұрын
What is concatenate?
@shiyabrezel
@shiyabrezel 5 ай бұрын
That’s
How to use Gen AI to read images and process notes
7:21
Don Woodlock
Рет қаралды 4,7 М.
Fine-tuning Large Language Models (LLMs) | w/ Example Code
28:18
Shaw Talebi
Рет қаралды 375 М.
Mom Hack for Cooking Solo with a Little One! 🍳👶
00:15
5-Minute Crafts HOUSE
Рет қаралды 23 МЛН
99.9% IMPOSSIBLE
00:24
STORROR
Рет қаралды 31 МЛН
Гениальное изобретение из обычного стаканчика!
00:31
Лютая физика | Олимпиадная физика
Рет қаралды 4,8 МЛН
VIP ACCESS
00:47
Natan por Aí
Рет қаралды 30 МЛН
Unlimited AI Agents running locally with Ollama & AnythingLLM
15:21
Tim Carambat
Рет қаралды 165 М.
What is RAG? (Retrieval Augmented Generation)
11:37
Don Woodlock
Рет қаралды 180 М.
How to Improve LLMs with RAG (Overview + Python Code)
21:41
Shaw Talebi
Рет қаралды 86 М.
Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)
20:19
Cole Medin
Рет қаралды 298 М.
OpenAI Embeddings and Vector Databases Crash Course
18:41
Adrian Twarog
Рет қаралды 515 М.
Python RAG Tutorial (with Local LLMs): AI For Your PDFs
21:33
pixegami
Рет қаралды 327 М.
Llama: The Open-Source AI Model that's Changing How We Think About AI
8:46
How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini
34:22
Google for Developers
Рет қаралды 74 М.
Mom Hack for Cooking Solo with a Little One! 🍳👶
00:15
5-Minute Crafts HOUSE
Рет қаралды 23 МЛН