Using Vector Databases for Multimodal Embeddings and Search - Zain Hasan - NDC London 2024

  Рет қаралды 1,633

NDC Conferences

NDC Conferences

Ай бұрын

This talk was recorded at NDC London in London, England. #ndclondon #ndcconferences #developer #softwaredeveloper
Attend the next NDC conference near you:
ndcconferences.com
ndclondon.com/
Subscribe to our KZbin channel and learn every day:
/ ‪@NDC‬
Follow our Social Media!
/ ndcconferences
/ ndc_conferences
/ ndc_conferences
#machinelearning #database #bigdata
Many real-world problems are inherently multimodal, from the communicative modalities humans use such as spoken language and gestures to the force, proprioception, and visual sensors ubiquitous in robotics. In order for machine learning models to address these problems and interact more naturally and wholistically with the world around them and ultimately be more general and powerful reasoning engines we need them to understand data across all of its corresponding image, video, text, audio, and tactile representations.
In this talk, Zain Hasan will discuss how we can use open-source multimodal models (such as github.com/facebookresearch/I..., that can see, hear, read, and feel data(!), to perform cross-modal search(searching audio with images, videos with text etc.) at the billion-object scale with the help of open source vector databases. I will also demonstrate, with live code demos and large-scale datasets, how being able to perform this cross-modal retrieval in real-time can help users add natural search interfaces to their apps. This talk will revolve around how we scaled the usage of multimodal embedding models in production and how you can add cross-modal search into your apps.

Пікірлер: 1
@s.m.mustafaakailvi2915
@s.m.mustafaakailvi2915 Ай бұрын
This is it. This is the future and I've been searching & experimenting for MONTHS and this is literally the FIRST instance I have found of this type of implementation of Multi-Modality!
Is .NET any good for Audio? - Mark Heath - NDC London 2024
47:19
NDC Conferences
Рет қаралды 3,6 М.
The day of the sea 🌊 🤣❤️ #demariki
00:22
Demariki
Рет қаралды 27 МЛН
Would you like a delicious big mooncake? #shorts#Mooncake #China #Chinesefood
00:30
How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini
34:22
Google for Developers
Рет қаралды 29 М.
An Introduction to Residuality Theory - Barry O'Reilly - NDC London 2024
54:15
Inside Microsoft AI innovation with Mark Russinovich | BRK256
56:48
Microsoft Developer
Рет қаралды 16 М.
The Future of Cookies - Anders Abel - NDC London 2024
43:52
NDC Conferences
Рет қаралды 1,9 М.
Common mistakes in EF Core - Jernej Kavka - NDC London 2024
1:05:04
NDC Conferences
Рет қаралды 5 М.
How to set up RAG - Retrieval Augmented Generation (demo)
19:52
Don Woodlock
Рет қаралды 12 М.
China 🇨🇳 Phone 📱 Charger
0:42
Edit Zone 1.8M views
Рет қаралды 381 М.
Cadiz smart lock official account unlocks the aesthetics of returning home
0:30
ВЫ ЧЕ СДЕЛАЛИ С iOS 18?
22:40
Overtake lab
Рет қаралды 95 М.
AI от Apple - ОБЪЯСНЯЕМ
24:19
Droider
Рет қаралды 106 М.