Twelve Labs: Multimodal AI That Understands Videos like Humans

  Рет қаралды 198

Jan Ozer

Jan Ozer

Күн бұрын

In this interview from NAB 2024 in Las Vegas, Anthony Giuliani of Twelve Labs describes his company's sophisticated approach to analyzing video content. This approach enables a human-like interpretation without relying on traditional metadata, and allows users to search, classify, and execute other tasks with videos by understanding various elements such as sound, speech, actions, and even visual cues like logos.
The implications are profound across many industries, including entertainment, sports, and security. By extracting metadata dynamically, the system enhances content discoverability and management, streamlining workflows and significantly improving the accuracy and relevance of search results within large video datasets.
Contents:
00:00:00 - Introduction to Twelve Labs and their video metadata extraction technology.
00:00:49 - Explanation of multimodal video understanding models that interpret videos similarly to human cognition.
00:01:40 - Discussion on the role of video embeddings in eliminating the need for traditional metadata while complementing it where available.
00:02:29 - Insights into the various modalities processed by their technology, including audio, speech, and visual cues.
00:03:26 - Overview of current users and applications, highlighting the involvement of enterprise customers like the NFL.
00:06:14 - Challenges in developing technology that provides a human-like understanding of video content.
00:11:47 - Potential future enhancements and the flexibility of video embeddings compared to traditional tagging methods.

Пікірлер
OMG🤪 #tiktok #shorts #potapova_blog
00:50
Potapova_blog
Рет қаралды 17 МЛН
What is RAG? (Retrieval Augmented Generation)
11:37
Don Woodlock
Рет қаралды 102 М.
GraphRAG: LLM-Derived Knowledge Graphs for RAG
15:40
Alex Chao
Рет қаралды 77 М.
What's the future for generative AI? - The Turing Lectures with Mike Wooldridge
1:00:59
Multimodality and Data Fusion Techniques in Deep Learning
23:01
ISTA Conference
Рет қаралды 4 М.
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
Страшно, когда ругается мама😰
0:10
Лиза Вертинская
Рет қаралды 1,3 МЛН
1❤️ #shorts
0:17
Saito
Рет қаралды 31 МЛН
Пугает людей игрушкой аллигатора в воде
0:14
Короче, новости
Рет қаралды 2,7 МЛН
I want to play games. #doflamingo
0:20
OHIOBOSS SATOYU
Рет қаралды 15 МЛН
Khi em gái tôi đắp mặt nạ || Mask of joy #shorts
0:11
Linh Nhi Shorts
Рет қаралды 2,4 МЛН