Advancing AI - Databricks Vector Search Index

  Рет қаралды 2,843

Advancing Analytics

Advancing Analytics

4 ай бұрын

The RAG (Retrieval Augmented Generation) pattern for keeping LLM's honest and accurate is super popular and being widely adopted, but you generally need to set up embeddings inside a Vector Database to get it working. Databricks recently released the Vector Search Index to automate this process for you, taking an existing Delta table and managing an underlying Vector Store!
In this video, Simon & Gavi look at the new Vector Search Index (VSI) functionality within Databricks, the limitations with the preview and the steps to get started working with it. Building a GenAI App in Databricks? This is your first step.
For more info on Databricks VSI, check out the docs here: docs.databricks.com/en/genera...
As always - if you're embarking on a GenAI application, get in touch with AA to give you a boost ahead!

Пікірлер: 2
@christianw3858
@christianw3858 3 ай бұрын
I see the benefit of the automated index update once you are pushing something to the table and not having to chunk that first. On the other hand, I see there a big disadvantage if you are applying a RAG architecture and feeding e.g. Azure OpenAI with it, as it returns the whole document rather than just the relevant chunks based on the vector search which can exceed tokens quite easily. I still would chunk it beforehand, do the embedding and udate the table. Do you know if this is possible and I would like to get the opinion on my statement here as well!
@user-cz7yr9hs6g
@user-cz7yr9hs6g 4 ай бұрын
@user-cz7yr9hs6g 0 seconds ago Hello Simon. I have been a fan of your videos for a while. I wonder if you could answer this quick question. If we were to pursue using Databricks for an end to end RAG implementation , what is a good pattern that you may have seen, integrating Databricks with some sort of UI? Streamlit, is one the suggestions that I saw. But ideally, would love to have something that could be not a custom build.
Advancing AI - Retrieval Augmented Generation
26:04
Advancing Analytics
Рет қаралды 1,9 М.
Advancing Spark - The Data Intelligence Platform
18:36
Advancing Analytics
Рет қаралды 4,2 М.
Each found a feeling.#Short #Officer Rabbit #angel
00:17
兔子警官
Рет қаралды 8 МЛН
World’s Deadliest Obstacle Course!
28:25
MrBeast
Рет қаралды 78 МЛН
Sprinting with More and More Money
00:29
MrBeast
Рет қаралды 182 МЛН
Hot Ball ASMR #asmr #asmrsounds #satisfying #relaxing #satisfyingvideo
00:19
Oddly Satisfying
Рет қаралды 47 МЛН
Behind the Hype: Is Data a Product?
22:33
Advancing Analytics
Рет қаралды 3,6 М.
Delta Lake Deep Dive: Liquid Clustering
40:54
Delta Lake
Рет қаралды 3,8 М.
The Power of Vector Databases For Knowledge Search
20:11
Code to the Moon
Рет қаралды 42 М.
Advancing Spark - Bloom Filter Indexes in Databricks Delta
24:41
Advancing Analytics
Рет қаралды 8 М.
The Gen AI Payoff in 2024: Introduction with Naveen Rao
16:36
Databricks
Рет қаралды 2,5 М.
Advancing Spark - Databricks In-Browser Interactive Debugger
14:49
Advancing Analytics
Рет қаралды 2,2 М.
Embeddings: What they are and why they matter
38:38
Simon Willison
Рет қаралды 20 М.
Each found a feeling.#Short #Officer Rabbit #angel
00:17
兔子警官
Рет қаралды 8 МЛН