Рет қаралды 1,789
Kyutai's Moshi is a new open source real time voice model, that is substantially better than GPT-4o, and Gemini. It can answer in real time, and can even interrupt you while you're talking if it wants to answer already. Is this the future of smart assistants, like Google Gemini, Amazon Alexa and Apple Siri? This model is mind blowing. It's voice is extremely flexible, and can express over 70 different emotions.
Links:
Moshi Chat Demo: bit.ly/45TSRrI
Kyutai: bit.ly/4csNXEo
Previous videos:
What OpenAI didn't show you: • What OpenAI DIDN'T tel...
Chapters:
00:00 - Intro
01:25 - What can it do?
02:57 - Working offline
03:53 - How did they do that?
05:36 - Who is Kyutai?
06:55 - Safety First
08:32 - What can it do right now?
09:55 - Summary