No video

Deep Learning with PolyAI: The Multilayered Anatomy of an AI Voice Assistant

  Рет қаралды 39,322

PolyAI

PolyAI

Күн бұрын

In this episode of 'Deep Learning with PolyAI,' we welcome Shawn Wen, co-founder and CTO at PolyAI. Shawn provides an in-depth overview of the AI tech stack essential for developing high-quality AI voice assistants. Inspired by Andreessen Horowitz's recent publication on AI voice agents, the discussion covers key components of a complex system, including speech recognition, voice activity detection, the application of generative AI models, and the integration of these technologies into practical applications. Shawn also explores the challenges of managing latency, how input affects selected speech recognition models, and the future of end-to-end AI systems. Join us as we unravel the complexities behind creating and optimizing effective voice AI solutions!
00:18 Understanding the AI Tech Stack
00:49 Building and Buying Voice Assistants
02:10 Speech Recognition Challenges
07:29 Voice Activity Detection (VAD)
10:31 Generative AI and Guardrails
15:58 Tooling and Function Calls
22:39 Future of End-to-End Models
#ai #voiceai #texttospeech #asr #aitechnology #deeplearning

Пікірлер
Unreasonably Effective AI with Demis Hassabis
52:00
Google DeepMind
Рет қаралды 156 М.
АЗАРТНИК 4 |СЕЗОН 2 Серия
31:45
Inter Production
Рет қаралды 565 М.
When you discover a family secret
00:59
im_siowei
Рет қаралды 32 МЛН
مسبح السرير #قصير
00:19
سكتشات وحركات
Рет қаралды 11 МЛН
a day in the life of an engineer working from home
8:42
Joma Tech
Рет қаралды 20 МЛН
What are AI Agents?
12:29
IBM Technology
Рет қаралды 303 М.
OK. Now I'm Scared... AI Better Than Reality!
8:10
AI Revolution
Рет қаралды 165 М.
These Illusions Fool Almost Everyone
24:55
Veritasium
Рет қаралды 3,4 МЛН
6 Years of Studying Machine Learning in 26 Minutes
26:05
Boris Meinardus
Рет қаралды 86 М.
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
AI Realism Breakthrough & More AI Use Cases
25:52
The AI Advantage
Рет қаралды 115 М.
How to use ChatGPT to learn ANY Language (new update)
13:26
Matt Brooks-Green
Рет қаралды 38 М.