End To End LLM Langchain Project using Pinecone Vector Database

  Рет қаралды 51,745

Krish Naik

Krish Naik

Күн бұрын

In this video we will be creeating an end to end LLM Project with vector.Pinecone makes it easy to provide long-term memory for high-performance AI applications. It’s a managed, cloud-native vector database with a simple API and no infrastructure hassles. Pinecone serves fresh, filtered query results with low latency at the scale of billions of vectors. database
github: github.com/krishnaik06/Comple...
Timestamp
00:00:00 Introduction To LLM Project
00:01:56 Setting up LLM Project
00:06:18 LLM Project Architecture
00:09:15 LLM Project Read the Data
00:16:05 Convert Data Into chunk And create vectors
00:21:40 Creating,Initializing,Inserting In Vectordb
00:27:58 Retrieving Results From VectorDB Similarity Search
----------------------------------------------------------------------------------------------------
Start Contributing in Open Source Projects
The-Grand-Complete-Data-Science-Materials
github.com/krishnaik06/The-Gr...
------------------------------------------------------------------------------------------------------
Complete Langchain Playlist:
• Amazing Langchain Seri...
--------------------------------------------------------------------------------------------------------
Support me by joining membership so that I can upload these kind of videos
/ @krishnaik06
------------------------------------------------------------------------------------------------------------------------------
►Data Science Projects:
• Now you Can Crack Any ...
►Learn In One Tutorials
Statistics in 6 hours: • Complete Statistics Fo...
Machine Learning In 6 Hours: • Complete Machine Learn...
Deep Learning 5 hours : • Deep Learning Indepth ...
►Learn In a Week Playlist
Statistics: • Live Day 1- Introducti...
Machine Learning : • Announcing 7 Days Live...
Deep Learning: • 5 Days Live Deep Learn...
NLP : • Announcing NLP Live co...
►Detailed Playlist:
Python Detailed Playlist: • Complete Road Map To B...
Python playlit in Hindi: • Tutorial 1- Python Ove...
Stats For Data Science In Hindi : • Starter Roadmap For Le...
Machine Learning In English : • Complete Road Map To B...
Machine Learning In Hindi : • Introduction To Machin...
Complete Deep Learning: • Why Deep Learning Is B...

Пікірлер: 129
@krishnaik06
@krishnaik06 7 ай бұрын
Check out other Langchain Projects below kzbin.info/www/bejne/aoCUo6ZtoteBetE&pp=gAQBiAQB
@saikatdas4654
@saikatdas4654 7 ай бұрын
Sir U r amazing. Sir, if possible then make videos on poison attacks on text data like trigger attacks or backdoor attacks.
@Nothing_but_NavEEEn_Official
@Nothing_but_NavEEEn_Official 7 ай бұрын
First like and Comment So Grateful for your guidance and contribution to the open-source community at the very need of Hour @KrishNaik Sir 🙏🏻❤️ Love from Bangalore ✨
@vivekanand1122
@vivekanand1122 7 ай бұрын
Amaizing Knowledge - Thanks Krish for your guidance !!
@dastajan6789
@dastajan6789 7 ай бұрын
Informative video like ever
@deekshad4774
@deekshad4774 3 ай бұрын
Thank you for such an amazing video!
@prasadagalave9762
@prasadagalave9762 7 ай бұрын
Thank-you dada ❤
@nooraftab6101
@nooraftab6101 4 ай бұрын
@krishnaik06: Thanks for the valuable resources. Couple of changes as I was running the code: 1) the model davinici-002 is deprecated. Instead OpenAI advises to use 'gpt-3.5-turbo-instruct'. 2) Pinecone initialization is also changed. For anyone following and coding from here, it was two breaking changes for me.
@rishabhranjan7687
@rishabhranjan7687 3 ай бұрын
Hey there, I'm stuck at pinecode initialization. Can you please tell me how you proceeded?
@darrendacosta6341
@darrendacosta6341 3 ай бұрын
@@rishabhranjan7687 hey have you been able to get it done?? if yes plz help .
@karangupta_DE
@karangupta_DE 4 күн бұрын
​@@rishabhranjan7687 the code in the video works with pinecode-client==2.2.1 But if you have 4.1.1, just search for langchain pinecode documentation and follow it step by step, it will resolve the issue.
@krishj8011
@krishj8011 11 күн бұрын
Amazing Tutorial...
@linuxmanju
@linuxmanju 6 ай бұрын
19:12 you need to return doc from the function not docs ( which is the raw input before chunks )?. Nice video and tutorials, thanks
@zainulabideen_1
@zainulabideen_1 Ай бұрын
Yes I also noticed same thing
@umeshnath8818
@umeshnath8818 7 ай бұрын
Awesome bro!
@inayatph
@inayatph 2 ай бұрын
thanku sir
@60pluscrazy
@60pluscrazy 7 ай бұрын
Excellent Krish 🙏 🙏 🎉🎉
@sangeethag1966
@sangeethag1966 5 ай бұрын
def chunk_data(docs,chunk_size=800,chunk_overlap=50): text_splitter=RecursiveCharacterTextSplitter(chunk_size=chunk_size,chunk_overlap=chunk_overlap) doc=text_splitter.split_documents(docs) return docs In this part you are not using chunked_data ..need to return - return doc...Is it right?
@machinelearning3518
@machinelearning3518 5 ай бұрын
yes have you checked len(doc)
@krishnaprasadsheshadri6206
@krishnaprasadsheshadri6206 5 ай бұрын
The similarity_search function is not defined? Any ideas how to define it cause I dont get how it works without the function being defined
@MansiKale-mr5ng
@MansiKale-mr5ng Ай бұрын
This is outstanding please share me some videos related to this Build one Q&A bot using open Al and Vector DB provide a couple of docs which should be stored in a vector db and when the questions asked to LLM model the LLM model should retrieve the answer from the vectors stored
@ShubhajitDutta-ve8bb
@ShubhajitDutta-ve8bb 6 ай бұрын
Please consider to build an application based on this LLM model
@quengelbeard
@quengelbeard 3 ай бұрын
Hey Krish, great video! Do you know if it's possible to automatically create a pinecone db index from code? So that you don't have to create them manually
@user-kx5ll7gj3u
@user-kx5ll7gj3u 5 ай бұрын
Hi Krish , Bigg thanks for all your contributions, I just have a doubt How to retrieve all docs that has been injected to elastic db?
@user-ho6iw4cg1c
@user-ho6iw4cg1c 3 ай бұрын
@krishnaik06, can we do the same with csv's having some kind of predictions from another ML model? Also, can we do some kind of weather report generation from IMD reports - I did try this, but it was not responding accurately. How to ensure the precision?
@tamilselvan3525
@tamilselvan3525 3 ай бұрын
26:28. We should give the chunks as an input to the function 'from_documents' but not 'doc'
@RaghavSoni-qv5bd
@RaghavSoni-qv5bd 3 ай бұрын
AttributeError: type object 'Pinecone' has no attribute 'from_documents' how can i resolve this error
@deepaliparochey2285
@deepaliparochey2285 2 ай бұрын
@@RaghavSoni-qv5bd you got the solution ??
@RaghavSoni-qv5bd
@RaghavSoni-qv5bd 2 ай бұрын
@@deepaliparochey2285 yes I got the solution of it
@user5398-v7x
@user5398-v7x 16 күн бұрын
Hey Krish that was a great video indeed just to let you know some of the functions related to PineCone Vector db are deprecated had to do a bit of googling from my side to connect to the database.
@autobase16
@autobase16 7 ай бұрын
🎉🎉🎉🎉
@soumyaranjansahu4262
@soumyaranjansahu4262 7 ай бұрын
Hey Krish, Could you please create a video on the setilup of pgvector
@SameerAli-nm8xn
@SameerAli-nm8xn 7 ай бұрын
Sir please try to apply dockers with it, because it will be really helpful.
@dhruvsanan4738
@dhruvsanan4738 6 ай бұрын
Hey!! Would you like to collaborate to make a llm app? I've already done this tutorial but I am thinking of making a full fledged app.
@tanmayjagtap78
@tanmayjagtap78 4 ай бұрын
Can anyone explain what is the difference between FAISS and Pinecone? In multiple PDF video he used FAISS but here Pinecone. Is there any specific way to use them?
@Stacks-ce2k
@Stacks-ce2k 5 ай бұрын
Sir I have a error of NoneType object is not callable in retrieve query vectore.similarity search line how to overcome with it
@user-ho6iw4cg1c
@user-ho6iw4cg1c 3 ай бұрын
Do we have to create vector db all time? Cannot we store the data permanently?
@anjalisingh1588
@anjalisingh1588 5 ай бұрын
I have one query - what if I want to connect my local database to a vector database and from that local database, the user can ask the question from the table !! Is this possible ?? if possible then how ??
@rishiraj2548
@rishiraj2548 7 ай бұрын
👍🙏
@GunjanGrunge
@GunjanGrunge 5 ай бұрын
Sir, lets say I want to deploy this chatbot trained on one pdf which has 2000 pages, can I remove the docuemnt part and only query from the db so that the load time and the embedding cost can be minimzed in production level chat bot, do let me know if you dint get the question, I will try to explain it in a better way.
@user-oz9fd8oc9j
@user-oz9fd8oc9j 7 ай бұрын
What is difference between loader vs parser in langchain?
@niravkhanpara5039
@niravkhanpara5039 6 ай бұрын
How its able to extract the text from the PDF? Is it using Tesseract or something else?
@user-me1ry6lg6d
@user-me1ry6lg6d 2 ай бұрын
how can i make an LLM model produce output that can be fomatted in the browser for display , like with html tags in it , is that possibel ?
@AIdevel
@AIdevel 5 ай бұрын
If I have many files in the same directory how to upload them ? thank you for your amazing vidoes
@ashokreddyz
@ashokreddyz 4 ай бұрын
Openai apikey which we nned to use free tier or subscription, I'm using pro version is 4 apikey error is unauthorized
@muaadhnazly7022
@muaadhnazly7022 Ай бұрын
Do I need to have a paid account of Open AI to use the OpenAI Embeddings? Thank You
@ShivamPatel-yg3kd
@ShivamPatel-yg3kd 7 ай бұрын
Actually inside chunk_data you are returning the wrong variable
@oliverkaunds
@oliverkaunds 5 ай бұрын
I did not understand one thing. Does the Vector DB talk to the LLM? We store data from PDF in Vector DB to make it searchable fast. But where is the part where we query Vector DB and send to LLM I could not see any LLM call may be I missed?
@shivanisingh8980
@shivanisingh8980 3 ай бұрын
Same question
@shivanisingh8980
@shivanisingh8980 3 ай бұрын
I think we use llm as open embedding model...
@mitejmadan8672
@mitejmadan8672 7 ай бұрын
Sir i dont have a credit card. Is it possible to create a project without using the openai api and just by open sourced models from huggingface and langchain. I am quite new in this llm field.
@RISHABH-VERMA
@RISHABH-VERMA 4 ай бұрын
Use Gemini-Pro's API instead of OpenAI.
@lakshmims7590
@lakshmims7590 3 ай бұрын
How run this directly on visual studio by git hub repository
@studytips7034
@studytips7034 3 ай бұрын
can you please solve this error AttributeError: type object 'Pinecone' has no attribute 'from_documents
@AkshayKumar-nh4fv
@AkshayKumar-nh4fv 3 ай бұрын
same error i'm also getting, it seems like they have removed this feature. Now only server less services can create index
@abhi9029
@abhi9029 7 ай бұрын
I am stuck on "You exceeded your current quota, please check your plan and billing details". Anyone help !
@mailsuresh9
@mailsuresh9 6 ай бұрын
I am stuck on the same @krish Naik. Please help
@mailsuresh9
@mailsuresh9 6 ай бұрын
Actually, added $5 to open ai and waited for 20 mins and tried and it worked. Just created a new API key for clarity.
@narnavar
@narnavar 5 ай бұрын
If you are using old account for accessing chat gpt ..create new account and key. 5 dollars can be used only till 3 months from the date of opening
@sefercanapaydn9347
@sefercanapaydn9347 5 ай бұрын
in chunk_data function you returned "docs" instead of "doc" isn't it wrong?
@shivanisingh8980
@shivanisingh8980 3 ай бұрын
Below you can see docs=doc
@lokeshsharma668
@lokeshsharma668 Ай бұрын
hi krish, when i run this code i am getting error like Pinecone has no attribute from_documents please help me to resolve this issue
@zayanahmad5706
@zayanahmad5706 5 ай бұрын
Hi! Guyz when I run the code I got some errors. Please help me
@SurajKumar-uf6it
@SurajKumar-uf6it Ай бұрын
Sir can we do this same for csv files. Please answer this question
@thomasferreira306
@thomasferreira306 5 ай бұрын
Hi, just want to tell you thay i created a similar project 8 month ago, everything was working but pinecone has migrate to a new version now "init" is no longer a top-level attribute of the pinecone package. I've made lot of reasearch but i don't find any solution to make it working again did you have a idea ?
@pranavbajpai5702
@pranavbajpai5702 5 ай бұрын
Yes needed help on this if anyone has a solution to this
@hamza_mehmood
@hamza_mehmood Ай бұрын
@krishnaik06 sir please can i have the link of that video which you are talking about at the end of this video??
@vaishnokmr
@vaishnokmr 7 ай бұрын
Disclaimer: if you dont have pinecone api, you can use chromaDB, Sqlite or multiple db but you have to knowledge about schema and tables.
@dhruvsanan4738
@dhruvsanan4738 6 ай бұрын
Hey!! Would you like to collaborate to make a llm app? I've already done this tutorial but I am thinking of making a full fledged app.
@stefano94103
@stefano94103 7 ай бұрын
Unfortunately 90% of the developer videos are useless because they only build the backend. This is useless to show your manager or a client. Videos that show how to connect a backend to the users front end would be 1,000 times more helpful
@gudiatoka
@gudiatoka 7 ай бұрын
Right 👍
@ARkhan-xw8ud
@ARkhan-xw8ud 7 ай бұрын
You can check code with harry he has made with frontend
@saurabhsinghjat
@saurabhsinghjat 7 ай бұрын
Bhai hr cheez thali me paros ke nhi di jaati. Fr kaahe ke developer. Thoda research kro sb mil jaayega. Is information age me bhi specific information ke liye ro rhe h hm to bantadhar h.
@shivanisingh8980
@shivanisingh8980 4 ай бұрын
​@@ARkhan-xw8udcan you plz share the link
@Vasu-Katravath
@Vasu-Katravath 2 ай бұрын
Then you build the backend, Krish Naik will try to make the front end videos if you feel backend is easier.
@hetdave8379
@hetdave8379 5 ай бұрын
I don't want to use model API key. Is there any way to integrate LLM model which is stored in local machine and query the SQL database?
@krishnaik06
@krishnaik06 5 ай бұрын
Use llama2 i have already created a video
@user-yk7qb4ib3e
@user-yk7qb4ib3e 7 ай бұрын
hi sir in gihub source code of test.ipynb not avialabel with full code it has only import libirary
@krishnaik06
@krishnaik06 7 ай бұрын
Updated the code
@programwithpradhan
@programwithpradhan 7 ай бұрын
Is there any open source embeddings we can use instead of Pinecone
@manasviemmadi8072
@manasviemmadi8072 Ай бұрын
pinecone is vector database that stores the embeddings, it doesn't offer embeddings itself. there are many open source embeddings online. you can check the huggingface MTEB leaderboard for embedding models.
@TheLeads
@TheLeads 2 ай бұрын
Error: init is no longer a top-level attribute of the pinecone package.
@m.rr.c.1570
@m.rr.c.1570 6 ай бұрын
Why mine is taking so much time to print the answers
@saransomasundharam2617
@saransomasundharam2617 4 ай бұрын
hello krish, I was tried this code facing some issue with Pinecone.from_documents. there is no library called from_documents in pinecone.
@dheerendrasinghbhadauria9798
@dheerendrasinghbhadauria9798 26 күн бұрын
I am also getting same error
@tharunps8048
@tharunps8048 7 ай бұрын
17:50 shouldn't we return "doc" instead of "docs" ?
@krishnaik06
@krishnaik06 7 ай бұрын
Yes i guess that is a simple mistake i did :).Gald you found it
@pk231
@pk231 7 ай бұрын
I too noticed and thats why the Chunks have not been created and the size remains 58 I guess. I am not sure though.
@rajvimultani871
@rajvimultani871 Ай бұрын
Can you pls provide next video link for UI part
@rahulpawar8594
@rahulpawar8594 Ай бұрын
@krishnaik06 : Thank for the detailed video. I need your help, I am trying to replicate the same with my list of documents. I am able to create vector DB, but when I am querying, something like, Give me the list of employee for which data is uploaded, I am not getting correct answer. Can you please help, how can we solve such type of scenario. your help is appreciated. Thank you
@abhisheksinghmahra446
@abhisheksinghmahra446 7 ай бұрын
i dont want to use openai api kindly make it with googlepalm api krish..i have to submit my college project tomorrow just let me know how can i do embedding with the help of palm api key i dont have openai subscription
@kiranshenvi2626
@kiranshenvi2626 3 ай бұрын
Hello Krish, Kindly request if you can share link to colab file if possible.
@amit2k5
@amit2k5 3 ай бұрын
The pinecone initialization step described in this video is not working. Any updated video or code please?
@deepaliparochey2285
@deepaliparochey2285 2 ай бұрын
did you get the solution ?
@shubhamkumar1305
@shubhamkumar1305 6 ай бұрын
How can we overcome that without using openai key?
@CodeVeda
@CodeVeda 4 ай бұрын
use gooogle's like .... GoogleGenerativeAI(model="models/text-bison-001",google_api_key=api_key, temperature=0.2) It works
@DebjyotiSaha-lh8lg
@DebjyotiSaha-lh8lg 4 ай бұрын
I am getting an error in the pinecone initiation process. Considering all the process you said I am getting error as pinecone has not attribute init. Can you please help me on how to proceed furhter
@syedmuzammilahmed6872
@syedmuzammilahmed6872 4 ай бұрын
Pinecone has now changed its framework. Some elements like "init," "from documents" etc. are now removed. So, the procedure used in this tutorial is now outdated.
@dijendersaini9056
@dijendersaini9056 4 ай бұрын
it should be Pinecone instead of pinecode in pinecode.init
@kyoli9847
@kyoli9847 4 ай бұрын
​@@syedmuzammilahmed6872 So how do we load a document object into pinecone now then? Since Pinecone.from_documents(docs, embeddings) no longer work
@deepaliparochey2285
@deepaliparochey2285 2 ай бұрын
@@kyoli9847 Anyone found the solution for this ?
@piyush_nimbokar_07
@piyush_nimbokar_07 2 ай бұрын
Pinecone instance is not being created and how to do in newer version in v3.2.2 please tell
@deepaliparochey2285
@deepaliparochey2285 2 ай бұрын
Did you get the solution ?? if yes please help
@piyush_nimbokar_07
@piyush_nimbokar_07 2 ай бұрын
@@deepaliparochey2285 use older version of langchian and pinecone it will work
@RaghavSoni-qv5bd
@RaghavSoni-qv5bd 3 ай бұрын
AttributeError: type object 'Pinecone' has no attribute 'from_documents'
@shinycaroline3722
@shinycaroline3722 3 ай бұрын
Yeah same issue, something wrong with langchain and pinecone once they have upgraded the version
@VikashKumar-ty6uy
@VikashKumar-ty6uy 5 ай бұрын
Getting this error while using pinecone vector db, can someone plz help me to resolve this pinecone.init( api_key="a4e02d-d1db-4d66-aaf9-fec1a868", environment="gcp-starter" ) index_name="langchainvector" AttributeError Traceback (most recent call last) Cell In[33], line 2 1 ## Vector Search DB In Pinecone ----> 2 pinecone.init( 3 api_key="a4e0266d-d1db-4d66-aaf9-fe1be7c1a868", 4 environment="gcp-starter" 5 ) 6 index_name="langchainvector" File e:\Gen AI Project\Langchain_using_vectorDB\vectordbvenv\lib\site-packages\pinecone\deprecation_warnings.py:38, in init(*args, **kwargs) AttributeError: init is no longer a top-level attribute of the pinecone package. Please create an instance of the Pinecone class instead.
@harshadapatke885
@harshadapatke885 4 ай бұрын
Pinecone has removed init() in latest update. please use pinecone-client==2.2.4
@shubhamkumar1305
@shubhamkumar1305 6 ай бұрын
Have you anyone give RateLimitError , 429 error?
@mailsuresh9
@mailsuresh9 6 ай бұрын
Yup. Stuck on the same on open AI 429 error.
@mailsuresh9
@mailsuresh9 6 ай бұрын
Actually, added $5 to open ai and waited for 20 mins and tried and it worked. Just created a new API key for clarity.
@parantikaghosh1182
@parantikaghosh1182 2 ай бұрын
pinecone is modified, this code does not work now, needs to be updated
@nitinsiwach1989
@nitinsiwach1989 2 ай бұрын
This video is outdated. The pinecone API has changed significantly and this tutorial wont work
@ayushmishra5861
@ayushmishra5861 2 ай бұрын
Did you find the updated solution?
@gowthamdora6146
@gowthamdora6146 7 ай бұрын
You need to return doc in chunk function but you are returning docs which is same as original documents,so you are not giving the text chunks for embeddings but whole documents.
@krishnaik06
@krishnaik06 7 ай бұрын
Ha bhai its a minor mistake u can update the code :)
@mohammadriyaz5586
@mohammadriyaz5586 2 ай бұрын
Please use free api's
@siddappakurabar9015
@siddappakurabar9015 7 ай бұрын
Sir how to get openai key sir
@user-hs5ty7gz7j
@user-hs5ty7gz7j 4 ай бұрын
Can someone please help me, PyPDFDirectoryLoader is not working, Below is the error, tried my best to resolve not happening Traceback (most recent call last): File "C:\LLMAppPinecone\test.py", line 4, in from langchain_community.document_loaders.pebblo import PebbloSafeLoader File "C:\Users\jchristian2\AppData\Local\anaconda3\Lib\site-packages\langchain_community\document_loaders\__init__.py", line 163, in from langchain_community.document_loaders.pebblo import PebbloSafeLoader File "C:\Users\jchristian2\AppData\Local\anaconda3\Lib\site-packages\langchain_community\document_loaders\pebblo.py", line 5, in import pwd ModuleNotFoundError: No module named 'pwd'
Why are vector databases so FAST?
44:59
Underfitted
Рет қаралды 14 М.
когда повзрослела // EVA mash
00:40
EVA mash
Рет қаралды 3,9 МЛН
OMG🤪 #tiktok #shorts #potapova_blog
00:50
Potapova_blog
Рет қаралды 18 МЛН
Became invisible for one day!  #funny #wednesday #memes
00:25
Watch Me
Рет қаралды 49 МЛН
ИРИНА КАЙРАТОВНА - АЙДАХАР (БЕКА) [MV]
02:51
ГОСТ ENTERTAINMENT
Рет қаралды 12 МЛН
Things Required To Master Generative AI- A Must Skill In 2024
15:01
OpenAI Embeddings and Vector Databases Crash Course
18:41
Adrian Twarog
Рет қаралды 413 М.
How I built my best ML project without going crazy
14:25
Boris Meinardus
Рет қаралды 12 М.
Vector Database Explained | What is Vector Database?
6:52
codebasics
Рет қаралды 68 М.
когда повзрослела // EVA mash
00:40
EVA mash
Рет қаралды 3,9 МЛН