Deep Learning with PolyAI: The Multilayered Anatomy of an AI Voice Assistant

No video

Deep Learning with PolyAI: The Multilayered Anatomy of an AI Voice Assistant

Рет қаралды 39,322

PolyAI

Күн бұрын

In this episode of 'Deep Learning with PolyAI,' we welcome Shawn Wen, co-founder and CTO at PolyAI. Shawn provides an in-depth overview of the AI tech stack essential for developing high-quality AI voice assistants. Inspired by Andreessen Horowitz's recent publication on AI voice agents, the discussion covers key components of a complex system, including speech recognition, voice activity detection, the application of generative AI models, and the integration of these technologies into practical applications. Shawn also explores the challenges of managing latency, how input affects selected speech recognition models, and the future of end-to-end AI systems. Join us as we unravel the complexities behind creating and optimizing effective voice AI solutions!
00:18 Understanding the AI Tech Stack
00:49 Building and Buying Voice Assistants
02:10 Speech Recognition Challenges
07:29 Voice Activity Detection (VAD)
10:31 Generative AI and Guardrails
15:58 Tooling and Function Calls
22:39 Future of End-to-End Models
#ai #voiceai #texttospeech #asr #aitechnology #deeplearning

Пікірлер

Unreasonably Effective AI with Demis Hassabis

52:00

Unreasonably Effective AI with Demis Hassabis

Google DeepMind

Рет қаралды 156 М.

What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata

46:02

What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata

The Royal Institution

Рет қаралды 990 М.

Как Алип забил в свои ворота | #Зенит #Футбол #СПБ

00:28

Как Алип забил в свои ворота | #Зенит #Футбол #СПБ

ЗЕНИТ

Рет қаралды 3,3 МЛН

АЗАРТНИК 4 |СЕЗОН 2 Серия

31:45

АЗАРТНИК 4 |СЕЗОН 2 Серия

Inter Production

Рет қаралды 565 М.

When you discover a family secret

00:59

When you discover a family secret

im_siowei

Рет қаралды 32 МЛН

مسبح السرير #قصير

00:19

مسبح السرير #قصير

سكتشات وحركات

Рет қаралды 11 МЛН

a day in the life of an engineer working from home

8:42

a day in the life of an engineer working from home

Joma Tech

Рет қаралды 20 МЛН

What are AI Agents?

12:29

What are AI Agents?

IBM Technology

Рет қаралды 303 М.

OK. Now I'm Scared... AI Better Than Reality!

8:10

OK. Now I'm Scared... AI Better Than Reality!

AI Revolution

Рет қаралды 165 М.

How to Make Learning as Addictive as Social Media | Luis Von Ahn | TED

12:55

How to Make Learning as Addictive as Social Media | Luis Von Ahn | TED

TED

Рет қаралды 7 МЛН

These Illusions Fool Almost Everyone

24:55

These Illusions Fool Almost Everyone

Veritasium

Рет қаралды 3,4 МЛН

6 Years of Studying Machine Learning in 26 Minutes

26:05

6 Years of Studying Machine Learning in 26 Minutes

Boris Meinardus

Рет қаралды 86 М.

Generative AI in a Nutshell - how to survive and thrive in the age of AI

17:57

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Henrik Kniberg

Рет қаралды 1,9 МЛН

AI and The Next Computing Platforms With Jensen Huang and Mark Zuckerberg

58:38

AI and The Next Computing Platforms With Jensen Huang and Mark Zuckerberg

NVIDIA

Рет қаралды 3,6 МЛН

AI Realism Breakthrough & More AI Use Cases

25:52

AI Realism Breakthrough & More AI Use Cases

The AI Advantage

Рет қаралды 115 М.

How to use ChatGPT to learn ANY Language (new update)

13:26

How to use ChatGPT to learn ANY Language (new update)

Matt Brooks-Green

Рет қаралды 38 М.

Как Алип забил в свои ворота | #Зенит #Футбол #СПБ

00:28

Как Алип забил в свои ворота | #Зенит #Футбол #СПБ

ЗЕНИТ

Рет қаралды 3,3 МЛН