Master RAG on Vertex AI with Vector Search and Gemini Pro

  Рет қаралды 4,944

Janakiram MSV

Janakiram MSV

Күн бұрын

Are you ready to take your question-answering systems to the next level? In this tutorial, we'll dive into integrating Retrieval Augmented Generation (RAG) with Google Cloud's Vertex AI Vector Search and the powerful Gemini language model.
You can access the complete code at gist.github.com/janakiramm/55... (Vector Search)
and
gist.github.com/janakiramm/7d... (RAG)
What you'll learn:
Understanding RAG: How RAG combines retrieval and generative techniques for superior question answering.
Setting up Vertex AI Vector Search: Create and configure your vector search index for efficient document storage and retrieval.
Harnessing Gemini: Leverage Gemini's language capabilities to enhance RAG's answer generation.
Step-by-step Implementation: Follow along as we build a RAG-powered question-answering system on Vertex AI.
Tips and Best Practices: Get insights for optimizing your RAG implementation.
Chapters:
00:00 Introduction
00:54 Overview of RAG
07:05 Configuring and Deploying Vector Search Index Endpoint
18:10 RAG with Gemini
LinkedIn: / janakiramm
#RAG #QuestionAnswering #GoogleCloud #VertexAI #VectorSearch #Gemini #subscribe #genai #tutorial

Пікірлер: 23
@IanMcAleer-op1xj
@IanMcAleer-op1xj Ай бұрын
Thanks, this is tremendously helpful One point to note - you need to upload the embed file, not the sentence file -> upload_file(bucket_name,embed_file_path)
@thecopt11
@thecopt11 3 ай бұрын
Best tutorial. Big thanks for your shared.
@edubr2011
@edubr2011 3 ай бұрын
Excelent video! Thanks for sharing the code too.
@Janakirammsv
@Janakirammsv 3 ай бұрын
Glad it was helpful!
@ShahidGhetiwala-dg3ol
@ShahidGhetiwala-dg3ol 3 ай бұрын
Great Video, thank you soo much........
@arvindmathur6574
@arvindmathur6574 3 ай бұрын
Great!
@sureshkumarselvaraj8911
@sureshkumarselvaraj8911 3 ай бұрын
Great video! What is the difference between Vertex Search service VS Vector Search for RAG application? which one is better in terms of handling better retrieval of relevant documents for RAG application where we deal with 100+ PDF documents? Can you share some insights?
@AhmedBesbes
@AhmedBesbes 3 ай бұрын
Thanks for the tutorial! Instead of going through the ids in the json file to fetch the sentences, is it possible to integrate those directly as metadata in the index?
@wanderlust8367
@wanderlust8367 Ай бұрын
the code link u have shared is incomplete, load_file is missing and other few stuffs,
@Hitish99999
@Hitish99999 2 ай бұрын
Thanks for the tutorial. I am bit confused which file to be uploaded to bucket. sentence file or embedding file
@MarceloFerreira-rl6hh
@MarceloFerreira-rl6hh 2 ай бұрын
Great job! Thanks a lot. What’s the difference between this approach and using langchain?
@GAURAVRAUT007
@GAURAVRAUT007 2 ай бұрын
Excellent video - can u please do same with Langchain with retrieval
@TomFord-mv2mx
@TomFord-mv2mx 3 ай бұрын
Great Video. One question, I noticed you used a different model (gecko) to Gemini Pro for the embeddings. Is this ok to do? I assumed the models needed to be the same for both training and inference? Thanks again
@Janakirammsv
@Janakirammsv 3 ай бұрын
Text embedding models are independent of LLMs. You only have to ensure that the same embedding model is used for indexing the documents and the query. This is critical to retrieving the context based on the similarity.
@vikasbammidi1340
@vikasbammidi1340 2 ай бұрын
Can you please do a video on "How to use the same in Langchain with retrieval"
@GAURAVRAUT007
@GAURAVRAUT007 2 ай бұрын
+1
@tarunrey619
@tarunrey619 3 ай бұрын
Thanks for sharing knowledge. Can you share the notebook
@Janakirammsv
@Janakirammsv 3 ай бұрын
Please check the description. I have added the links.
@JulianHarris
@JulianHarris 3 ай бұрын
Nice. Are you ok to share the colab notebook?
@Janakirammsv
@Janakirammsv 3 ай бұрын
Yes, sure. Please check the description. I have added the links.
@user-mk3qb3iv2i
@user-mk3qb3iv2i 2 ай бұрын
why always python is there any way to use js?
@khondakersajid1138
@khondakersajid1138 3 ай бұрын
Possible to share the notebook?
@Janakirammsv
@Janakirammsv 3 ай бұрын
The code is available at gist.github.com/janakiramm/55d2d8ec5d14dd45c7e9127d81cdafcd and gist.github.com/janakiramm/7dd73e83c92a0de0c683ed27072cdde2
Using LangChain with Gemini and Chroma DB
23:48
Janakiram MSV
Рет қаралды 2,2 М.
Grounding for Gemini with Vertex AI Search and DIY RAG
35:31
Google Cloud Tech
Рет қаралды 9 М.
Sigma Girl Past #funny #sigma #viral
00:20
CRAZY GREAPA
Рет қаралды 33 МЛН
World’s Deadliest Obstacle Course!
28:25
MrBeast
Рет қаралды 158 МЛН
How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini
34:22
Google for Developers
Рет қаралды 38 М.
Vector Search and Embeddings
34:43
Google Cloud
Рет қаралды 6 М.
Tune and deploy Gemini with Vertex AI and ground with Cloud databases
38:33
Google Cloud Tech
Рет қаралды 3,2 М.
Gemini Multimodal RAG Applications with LangChain
59:36
Google Cloud Events
Рет қаралды 12 М.
Building transformative applications with Gemini on Google Cloud
59:19
Google Cloud Events
Рет қаралды 8 М.
RAG from the Ground Up with Python and Ollama
15:32
Decoder
Рет қаралды 25 М.
BigQuery vector search and embedding generation
10:08
Google Cloud Tech
Рет қаралды 8 М.
What is Retrieval-Augmented Generation (RAG)?
6:36
IBM Technology
Рет қаралды 589 М.
When you have 32GB RAM in your PC
0:12
Deadrig Gaming
Рет қаралды 1,4 МЛН
Как слушать музыку с помощью чека?
0:36