A variation of this approach would be to pass the response from ChatGPT to a fine-tuned text to speech(TTS) model(say AWS Polly) to mimic Gaurav's voice, intonation and delivery style. Learner's come for Gaurav's teaching style and this would greatly enhance the user experience. One other extension would be to make this real time. Where the learners can pause the lecture at any point and ask a question contextually at that point. This would mimic the experience of a live class where the student can get his query solved interactively.
@akshaychavan55112 ай бұрын
Just what I was looking for!! Building a project with similar specification. This video helped me understand the system design aspect of it.
@gkcs2 ай бұрын
Cheers!
@naveenthota6429Ай бұрын
One of the best videos I saw in recent times. Great details. Thank you and God bless you.
@gkcsАй бұрын
Thank you!
@AnishGeorge-b2v10 күн бұрын
Awesome content. I have been searching for this content for the past one month and just found it 😂. And it is only 13 minutes?. Straight to the point. Thank you. God Bless.
@gkcs10 күн бұрын
Thank you!
@Condinginsight2 ай бұрын
wa..o... superb!! keep your smart work like that, waiting for your next video.
@gkcs2 ай бұрын
Thank you for watching! Resource links: Neon DB - fyi.neon.tech/gs1 AWS Transcribe: aws.amazon.com/pm/transcribe
@sachinawati63322 ай бұрын
create a AI teacher using swarm agents, like it should personal ai teacher for all subjects with proper accuracy and according to my knowledge modify the content my level of understanding format type
@saranrajiyyanar3339Ай бұрын
Please share your GitHub repo link, How to join you class?
@lifebeckons-wo1ry12 күн бұрын
Great video and explaination. What is the best tool to transcribe videos. I have a similar use/business case for B2C.
@MrYtkrishna22 күн бұрын
Bro just gave someone a job!! Well done 😎
@alixaprodev2 ай бұрын
I had this challenge previously. Thank you os much for this.
@gkcs2 ай бұрын
Cheers!
@haryanvi__nomadКүн бұрын
Nice Video Gaurav, would love to see a hands on demo.
@IamSinghJaskaran2 ай бұрын
Great video! I’ve been looking for something like this. I’m currently working on the concept of vector embeddings and building a project around it, but I’ve been struggling to come up with a clear use case for myself or visualize a real-life application-until now. This really helped!
@gkcs2 ай бұрын
That's awesome, good luck with your project!
@pritampaul8167Ай бұрын
A very insightful video! Thanks Gaurav
@manuelblinkert3200Ай бұрын
Very good conceptual video about a RAG pipeline. well done.
@gkcsАй бұрын
Thank you!
@elangoy2 ай бұрын
Keep UP the GOOD WORK , your videos are very good and of good quality
@gkcs2 ай бұрын
Thank you!
@ramanathTVАй бұрын
Extremely well explained , Loved it ❤
@gkcsАй бұрын
Thank you!
@surajsrinivas75172 ай бұрын
Very cool and concise video. Was not aware of this and thanks for sharing.
@gkcs2 ай бұрын
Glad it was helpful!
@66ravirk4 күн бұрын
Very insightful
@RahulsapkotaАй бұрын
Really insightful loved it. I heard lot of rag everywhere now I have a good idea on this thanks
@RahulsapkotaАй бұрын
Can we get a video of what is agents how to get started on that just a brief video like this would be really helpful
@towardsthenorth2022Ай бұрын
Great work Gaurav !
@gkcsАй бұрын
Thank you!
@shivprakashy2 ай бұрын
Very informative. Thanks for this. Throttled my network to read the tech bits on your website. Couldn’t find a Community menu, I think that will engage users. More users on website may lead to more conversion.
@gkcs2 ай бұрын
Hahaha, that's a nice way to read the tech bits 😁 I am looking to add a community section in 2025. Thanks for the feedback!
@PraghaniOther4 күн бұрын
Great one. Keep it up.
@VijayR-h2yАй бұрын
Thanks, nicely explained. Looking fwd to more such AI system designs.
@gkcsАй бұрын
Thanks, more are coming!
@pusarlaaishwarya50352 ай бұрын
Bro thankyou for this concept😊😊😊
@sumitbasu51462 ай бұрын
Thank you Gaurav 😁
@gkcs2 ай бұрын
Thank you!
@SuperGojeto2 ай бұрын
Amazing! I want to do a personal project based on nearly this concept but I want everything to be locally handled including the chatgpt answer generation part. Although it's not related to text but majorly images. Great video anyways.
@gkcs2 ай бұрын
Go for it! The generation is also possible if you have your own transformers.
@ANANDKUMAR-ji4mv2 ай бұрын
Thanks Gaurav
@gkcs2 ай бұрын
Cheers 😁
@MitaliNeerPatel2 ай бұрын
helpful, thank you.
@gkcs2 ай бұрын
Thank you :D
@thebibhutyАй бұрын
Comment unrelated to the video: Gaurav one request: Coming from someone who have completed >50% of interview ready system design course. I loved your system design but deep knowledge comes beyond 1 hour of the System Design interview for a specific topic. Ex: - you teach us how to design uber in 1 hour for a system design interview and that works till we get a job. - now as a curious engineer I want to do deep dive into Uber's say hexagonal mapping thing. What enhancement I am requesting is when you design a system design topic please divide into 2 parts: 1. for interview(you're already doing) 2. beyond interview(for extreme deep dives like research papers, blogs etc. related to point 1) I know you have a dedicated research paper discussion section, I am yet to go there but it would be great if we can extend it to the system design topics which you have already taught, it'll help us getting a holistic view. Regards From your longtime student
@gkcsАй бұрын
Thank you for the suggestion, I will think about how to make this a reality :) Until then, you can use this page to filter through great deep diving resources: interviewready.io/resources/
@VaibhavPatil-rx7pc18 күн бұрын
Great, thanks
@DevyaniArya082 ай бұрын
a muchhh needed video, thankss a lottt sirrr😭😭😭😭😭😭😭😭😭😭😭😭
@gkcs2 ай бұрын
Cheers!
@bulliethepirateАй бұрын
Thank you for this fantastic video. Very clear and I like the diagrams. One completely unrelated piece of advice: incorporate exercises that target your shoulders and neck. Not for getting bigger, but to improve your posture and enhance your overall well-being.
@gkcsАй бұрын
Thank you! Agreed, I'll work on it :D
@RaymaxOfficial2 ай бұрын
Hey Gaurav, excellent explanation as always. I have a couple of questions I wanted to ask: 1. How do you create a vector database? Does Neon DB assist in this process? Do we simply pass the transcript to Neon DB, and it returns the vector file for that transcript? 2. Is this what our request to ChatGPT looks like? -> "What is load balancing?" and here are some vector databases for context: transcript-vector.txt. 3. So, does ChatGPT receive the query from the user and the vector database for the transcript provided by Neon DB? 4. I also don't understand how Neon DB selects which transcript vector to send to ChatGPT. Thank you for the video.
@gkcs2 ай бұрын
Thanks Raymax! 1. Creating a database a one click operation in Neon. No we have to first get an embedding for the transcript using a model. There is LlamaIndex, etc... for this. 2. User prompt: What is chat gpt. System Prompt: You are a system design teacher who answers in 2-3 sentences. Something like the above. 3. ChatGPT talks to our server, the InterviewReady server. Our server finds files most relevant to a query from NeonDB and uses them to add context to the query. 4. It's a vector embedding search. There are various search algorithms for this, like K-nearest and HSNW.
@RaymaxOfficial2 ай бұрын
@@gkcs thankyou sir !! 🫡
@harinijaan37792 ай бұрын
Great video. Your humour "depending on your salary you can set up the instance you want ...xD"
@gkcs2 ай бұрын
😁
@goleshahaji25 күн бұрын
great walkthrough presentation. If I am not wrong, the neon database needs to store embeddings and not the plain text provided by aws transcribe.
@gkcs25 күн бұрын
Yes it stores the embeddings.
@thecoder75702 ай бұрын
Awesome 👍
@gkcs2 ай бұрын
Thank you!
@shubhamjagtap1082 ай бұрын
Thanks
@KaushikSharma-q2r5 күн бұрын
Good content ❤
@Guru-mf3ew2 ай бұрын
I really really like your videos
@gkcs2 ай бұрын
Thank you!
@anaroidd785112 күн бұрын
Great Video. But some finer details or a deep-dive part-2 might be helpful. In your demo, you manually added a system prompt with the files. But in a real chat scenario you would generate the system prompt after querying the Neon DB. Is that left as an exercise to the viewer?
@ash17942 ай бұрын
Another way for transcripts is to use youtube as well. It generates transcripts whenever a file is uploaded.
@gkcs2 ай бұрын
That's a great idea, thank you!
@siddhantota28572 ай бұрын
Ammmaazing boi... thanks
@gkcs2 ай бұрын
Cheers :D
@jahliveabiАй бұрын
It would be good to give more context on when to implement a vector db. Your explanation is brilliant and I will learn from it. Asking question to OpenAI already has multiple implementations of vector db right? So how does it benefit a certain use case?
@sourabhkhandelwal6892 ай бұрын
Great video, Gaurav. Also, from where do I get that cool tshirt?
@gkcs2 ай бұрын
Thanks! I got it as a gift, so I am not sure where. I think it's from a creator shop in Instagram though.
@indianatlargeАй бұрын
awesome vid.. i am your fan..
@Codersky012 ай бұрын
Waao very useful
@dhruvwills2 ай бұрын
So basically, You are storing raw files using openai files api, and storing vector embedding in neon vector db, when a user makes a prompt you fetch the similar vectors from the neondb and inject in prompt sent to openai text generation api. what if user deletes the file, we sure can delete it from openai files, but how do we delete the vector embedding of a particular file ?
@deesiInGermany2 ай бұрын
Too have the same doubt
@dhruvwills2 ай бұрын
@deesiInGermany i found that we have to attach metadata with the vector and when the file is deleted, we have to search embedding that have same name in metadata. I also wondered that if we cannot use uploaded files directly in the prompt like "refer file {file1}", than whats the point of storing it on openai server. We should store it on our server only for better flexibility in future.
@deesiInGermany2 ай бұрын
@@dhruvwills thanks Dhruv for your reply. I am starting my AI journey with a project. Just confused here if we are storing our files in Vector DB and then will only share the part of the context with the question to gpt, then what's a point of storing files in ChatGpt. Also storing theses files will consume tokens means extra cost
@dhruvwills2 ай бұрын
@@deesiInGermany the primary objective of files in gpt is to finetune model, but if you do it then it will generalize for each user. Thats is why we are providing the content in context, so in this case storing files on openai is just acting as a database, this is not necessary, you can store files on your own server and generate embedding using openai api, then store that embeddings in neondb. Here is how it would work, You would use openai api to send the text which was stored on the file and openai will return embedding which you will store on neon vector db. Earlier there used to be answers and query api in openai but that is now deprecated. They suggest to use embedding now. So basically, attach embedding in context, and you are good to go.
@deesiInGermany2 ай бұрын
@dhruvwills perfect Dhruv. I'll give it a try. Thank you
@satyabatigoswami17352 ай бұрын
Great ecplanation and thanks a lot for this concept!!!I have one doubt. Will the data in neon db be static? How are you storing the data in the db for a particular transcript?
@gkcs2 ай бұрын
Thanks! Yes the data is static for a transcript. I can update the vector embeddings by clearing the data and replacing all the vectors again.
@satyabatigoswami17352 ай бұрын
@@gkcs thanks a lot for the reply 😇
@m1cannasАй бұрын
❤ super
@maddymadanraj2 ай бұрын
hey gaurav can we also store images in vector database? example can i store youtube thumbnails ? and while prompting gpt can use those as reference and provide me a better outcome love your content 🙌🏽
@gkcs2 ай бұрын
Thanks! You can store the files. The images can be converted to vectors, yes. But OpenAI is not suitable for this type of embedding, by my limited understanding. A multimodal system will do better.
@maddymadanraj2 ай бұрын
@@gkcs thanks 👍
@RamanujaSvl9 күн бұрын
How are the context files generated? I understand that the transcription is done by AWS but for a given query how are files with relaxant context chosen? In other words what's the input to Neon DB? Please excuse me if my question doesn't make much sense totally new here. Keep up the good work 👏🏾
@shubhamrajput26672 ай бұрын
Hi Gaurav, did you consider elasticsearch as well? I mean just for benchmarks
@gkcs2 ай бұрын
I have used ElasticSearch (OpenSearch) earlier: it's very expensive and not as good as a vector DB like PGVector. The problem with text search is that it relies too much on written words instead of the context between them. Maybe I could have used the OpenSearch better, but my experience with Neon (PGVector) is better.
@shivambaghel96682 ай бұрын
people like me who do not have money or do not want to spend money on chatgpt ,can user ollama to run a LLM model on a local instance and run llama3, or mistral for RAG
@gkcs2 ай бұрын
Yes that's right.
@princezuko7073Ай бұрын
So which is which? I mean what is the vector database here? Llama3? And ollama is the llm model to which we will give the augmented query with context that we will find from the llama3?
@Polaar_b2 ай бұрын
1. Is Neon db hosted in your vpc? 2. How do you maintain files/ embedding in vector db (say- For another RAG project where users wants to chat with their documents then how do we query vector db where multiple users have their own documents)? 3. Do you think running llm instance locally (eg. ollama) is better option!!
@bonitokumar497722 күн бұрын
Wonderful architecture. One question , vector db seems to be equivalent to tag cloud in n dimensions. If I have to put tokens into n dimensions, I need tags and it's values for n axis. How do I arrive at values ?
@devrelsquad2 ай бұрын
🎉🎉 Amazing
@gkcs2 ай бұрын
Cheers!
@BratinJ3 күн бұрын
I think AWS Kendra does the same. It’s not cheap though.
@TotalTechnologyZonne2 ай бұрын
Small doubt ,At the beginning, you mentioned using ChatGPT to store transcripts in the vector database. I believe you meant to say that you used a text embedding model from OpenAI to generate and store vector embeddings.
@gkcs2 ай бұрын
That's right. We got the embeddings with OpenAI: platform.openai.com/docs/api-reference/embeddings
@TheGsinghg2 ай бұрын
No. He stores the transcripts. He uses the "Retrieval" part, i.e. getting similar video files using Neon and tells the GPT model via API to consider files with index [i,j,k ...] to be used for augmentation for answer generation. I intentionally broke down the RAG across the sentence so that you know it is not something fancy.
@phanikumar278Ай бұрын
Nice video , i always treat you like peer having discussion in room , you are good techie . One question though, how will you protect the privacy or content of the file that you sent to chatgpt ? will they use that for training or is it very locale to you when you store it ( I assume it will be for your user account )
@gkcsАй бұрын
I think they'll end up using it for training. I have to read their TnC page to be sure. It is local to my account (I am paying for their API access, so it shouldn't be shared anywhere else). Thanks for the comment 😁
@gkcsАй бұрын
@@Parul-u6e Thanks for letting me know :)
@rameshpraveen24882 ай бұрын
Hi, so am I correct in understanding that after feeding ChatGPT all your transcript files, the queries with no context didn’t yield good response; then with context pointing to a specific file yielded better response; and with context to multiple related files it yielded even better response?
@gkcs2 ай бұрын
That's right 😁
@sarojpatra83062 ай бұрын
Very nicely explained. Just right amount of information.
@gkcs2 ай бұрын
Thank you!
@bahubalichanduwad2 ай бұрын
Are you suggesting we pass all the matching files for getting queries answered? Would it not be very expensive?
@peeyushagrawal212 ай бұрын
great application :) can you please share the source code so that the extensions can be made out of it.
@Md_sadiq_Md2 ай бұрын
4:18 uff the humour
@gkcs2 ай бұрын
:p
@Neki_Works28 күн бұрын
This is a very good 👍 but I think that ChatGPT or other LLMs can answer the questions directly without storing data in them. I think you tried mentioning this in the beginning of the video and I'll like to know why it didn't work directly
@jitensi3 күн бұрын
@@Neki_Works Ij this case it might but when you are developing projects in organizations where the data is not public , the models wouldn’t know the answer to those questions. You can then use org specific data as embedding a to answer your questions. RAG systems are most useful inside organisations
@epistemophilia-32 ай бұрын
When a question is asked, it needs to be sent to Neon DB, where similar context transcripts are retrieved from the database. These transcripts are then sent to OpenAI, and the response is returned and displayed to the user. I want to know if this process happens instantly, or if I am comprehending it incorrectly.
@gkcs2 ай бұрын
It doesn't happen instantly, it takes about 6 seconds for OpenAI to respond. We are working on showing a loader while this happens, so the UX is good.
@vam8775Ай бұрын
I got this video after commenting on a post
@BloggerVikash2 ай бұрын
I am using digital ocean droplet to host Postgres, can i use Neon? how do you rate Neon??
@gkcs2 ай бұрын
I found Neon fast and good.
@BloggerVikash2 ай бұрын
@ i want overall rating sir Like Can we trust for production ready app, which may cater lakh around user?
@PrabhakarKumar972 ай бұрын
@@BloggerVikash Read the fine manual
@BloggerVikash2 ай бұрын
@@PrabhakarKumar97 yes i have gone through Currently avoiding neon Because of the region. Indian region is not available
@jogendragurjar89958 күн бұрын
Why you have not used Amazon Aurora for interview-ready database?
@sounishnath5132 ай бұрын
I have one query, for caching what are you essentially caching? Is it only the semantic query relevance if multiple users asks similar query? Would appreciate if you throw some light off
@sarvesharora9503Ай бұрын
I guess I didn't understand the problem well. Can't Gpt goes through all of your files (it has already too much data which it's trained on why can't go through some more) ?
@algoforyou23 күн бұрын
To store records in a vector database, which model did use? I assume in the vector database, you are storing embedding vectors for each of the video recordings. is that right?
@m.f.mfazrin87207 күн бұрын
I think you can do it with Notebook LLM easily
@RakeshRout-x8w4 күн бұрын
how to make it history context aware multi user bot?kindly cover that
@logeshkumar83332 ай бұрын
Sorry, May i kindly know what steps are involved in augmentation (a in rag)
@savrah6 күн бұрын
Is this same as customgpt knowledge base file ? I mean when would you use customgpt knowledge base and vectors?
@akexop74042 ай бұрын
Can we utilised not diamond ai tool which provides free apis for different ai models, so does it works the same?
@experimentscraze3 күн бұрын
Suppose I am looking for using this system you described to build sql queries, in such case I need to upload the DDL of the tables as files in neon and same DDLs in ChatGPT as files. In such case can I get query from gpt generated with prompt I give in natural language for eg. Give me top 10 customers. Will this be giving me correct query.
@simkeyur7Ай бұрын
Depending on your salary got me 🤣🤣🤣
@mohitisimmortal2 ай бұрын
bro, how to make embeddings in vector database, like use openai embeddings api for that or neon did that?
@gkcs2 ай бұрын
You can use openAI embeddings, yes. You can also use other open source embedding algorithms. We used OpenAI because it's something we have heard of :p
@12920332 ай бұрын
would you be able to open source the entire solution? Like a git repo?
@PrajwalMSPАй бұрын
Did you use Langchain?
@icepreyasАй бұрын
@gkcs would the outcome of this be the same as creating a custom GPT in OpenAI using your transcript files and calling that GPT via the API?
@shubhamsonarikar88772 ай бұрын
While working with AWS transcribtion for audio queries its very slow and time consuming is their any way to increase performance for this process specially using audio .mp3 file from s3 bucket
@gkcs2 ай бұрын
Adobe takes time too and so does Vimeo. I found AWS easier since it's an API and runs async.
@wasitechlearningАй бұрын
May I get your AI Teacher demo? How it actually looks like.
@JUNGLESAFARI-INDIA24 күн бұрын
Files that are uploaded to chatgpt, will get uswd for training chatgpt?
@W_G-yb6ngАй бұрын
Are you using any Guardrails?
@vamsee825 күн бұрын
I somehow feel vector database here is an overkill.. why cant a recommendation search work here with content filtering? may be you could have directly used elastic search for getting relevant recommendations for the topic and then pass it on to Open AI to get appropriate answers... or even graph database would suffice
@harshawardhankamble49672 ай бұрын
Woke up to this
@gkcs2 ай бұрын
Good morning!
@gauthamvijayan2 ай бұрын
💖
@The_Daily_chessАй бұрын
Hey can anyone tell me about approx cost of this project or a working prototype?
@dhruvmehta41812 ай бұрын
How do you check whether your system is not used other than asking questions related to your videos only as Amazon also got this wrong in its first try where it written python code for the user
@navin346Ай бұрын
What does load balancing mean?
@navin346Ай бұрын
Brilliant video btw. Just get a hang on the stuff that am looking for 🤝
@gkcsАй бұрын
Hahaha!
@puneetarora17142 ай бұрын
like marvel, the post credits 🤣🤣🤣
@gkcs2 ай бұрын
:D
@SreeRamakrishnaNagulakondaАй бұрын
This an use case and real implementation of an AI agent correct? @gkcs
@Hercules1592 ай бұрын
pg vector = PostgreSQL ?
@gkcs2 ай бұрын
@@Hercules159 Yes it's a vector database extension with the Postgres DB.
@Hercules1592 ай бұрын
@gkcs thx a lot
@shankarnarayanan17322 ай бұрын
Osm
@gkcs2 ай бұрын
Thank you!
@RR-zg5rd2 ай бұрын
hahaha ! that laugh for 100$ was epic
@gkcs2 ай бұрын
Bootstrapped budget constraints :P
@codeblood872 ай бұрын
Waooo
@gkcs2 ай бұрын
Thank you!
@mdmishfaqahmed21382 ай бұрын
PG-vector 💀
@hypergaming3682Ай бұрын
Why this '💀' though
@abhi_9724 күн бұрын
PGVector scale 💀☠️ >>
@mdmishfaqahmed213824 күн бұрын
@@hypergaming3682 parental guidance required.. this vector aint for toddlers
@hypergaming368224 күн бұрын
@@mdmishfaqahmed2138 open to learn something new from the masters