Andrew Seagaves, VP of Research at Deepgram | AIMinds

  Рет қаралды 59

Deepgram

Deepgram

Күн бұрын

In this episode we are joined by our very own Andrew Seagaves, VP of Research at Deepgram, explores text-to-speech (TTS) technology and language modeling. With a PhD from MIT and a background in AI-driven explosive design, Andrew now leads advanced speech recognition research. He discusses the challenges of creating natural-sounding TTS systems, the role of context conditioning, and his career journey from MIT to Deepgram.
Episode Highlights:
- Andrew Seagaves shares his insights on why language modeling poses such a complex challenge, particularly in the domain of text-to-speech systems.
- Seagaves discusses how future developments promise to address these issues dramatically.
- From his initial steps at Deepgram working on speech recognition and diarization, to his current focus on scaling models for varied languages and contexts-discover Andrew Seagaves' transformative journey in AI.
- Andrew’s fascinating career trajectory, from designing defense technologies at MIT to spearheading voice technology innovations used by global leaders like Spotify and NASA.
- Demetrios and Seagaves express excitement for the near future of TTS technology, hinting at groundbreaking features that will redefine our interaction with digital devices.
-------------------------------------------------------------
Connect with Andrew Seagaves
/ seagravesan
Connect with Demetrios:
/ dpbrinkm
Connect with Deepgram:
deepgram.com/
/ deepgram
x.com/deepgramai

Пікірлер: 5
@lets-talk-ai
@lets-talk-ai 17 күн бұрын
Loved it!
@mgevirtz
@mgevirtz 20 күн бұрын
Best AI show I've seen to date. The STT as a information discarder is a great observation.
@lets-talk-ai
@lets-talk-ai 17 күн бұрын
Trueee
@mgevirtz
@mgevirtz 20 күн бұрын
How will the asymmetry between languages in training data affect AI and its business uses in the 5-10 year time scale?
@scott_stephenson
@scott_stephenson 19 күн бұрын
The biggest change will be the ability to generate expressive data in any language (via audio generation, the most constrained version of that being TTS), as a way to produce massive scale datasets for any language.
Derek Wang, Co-founder at Taalk | AIMinds #035
27:48
Deepgram
Рет қаралды 57
The Turing Lectures: The future of generative AI
1:37:37
The Alan Turing Institute
Рет қаралды 602 М.
Пришёл к другу на ночёвку 😂
01:00
Cadrol&Fatich
Рет қаралды 10 МЛН
escape in roblox in real life
00:13
Kan Andrey
Рет қаралды 74 МЛН
Teaching a Toddler Household Habits: Diaper Disposal & Potty Training #shorts
00:16
大家都拉出了什么#小丑 #shorts
00:35
好人小丑
Рет қаралды 95 МЛН
From Disabilities to Dream Analysis!
0:59
Deepgram
Рет қаралды 9
Mind Control Technology
50:27
Risk Group LLC
Рет қаралды 9 М.
Andrew Yang's Plan For Black America
29:05
The Root
Рет қаралды 470 М.
The Minister's Millions I Al Jazeera Investigations
25:12
Al Jazeera English
Рет қаралды 1,7 МЛН
MIT Bitcoin Expo 2022: Breaking Through - Fireside Chat, Michael Saylor
46:44
Intel CEO Pat Gelsinger on Intel's place in the semiconductor industry
53:54
Manufacturing @ MIT
Рет қаралды 36 М.
There are monsters in your LLM.
2:15:23
Machine Learning Street Talk
Рет қаралды 73 М.
Do you think that ChatGPT can reason?
1:42:28
Machine Learning Street Talk
Рет қаралды 63 М.
Пришёл к другу на ночёвку 😂
01:00
Cadrol&Fatich
Рет қаралды 10 МЛН