Yes!! replace the traditional TTS! Please bring this in google play books! I would love to have my books being read to me like an audio book! Game changer!
@alias_ansuz3336Күн бұрын
Up!
@kromanfr20 сағат бұрын
@@alias_ansuz3336it's already possible with IIReader
@maxcomperatoreКүн бұрын
the speed of this is astonishing
@Tinman462Күн бұрын
This is how the world ends... one perfectly-pitched whisper at a time 😊
@sj00100Күн бұрын
Yeah remember when world ended when we had text to speech for years
@vectoralphaSecКүн бұрын
Ill take it.
@DaveK-q9yКүн бұрын
The whisper… all those ASMR youtube videos were useful
@BeepBeepBeepbopКүн бұрын
SOO exited for an alternative for OpenAI advanced voice!!!!
@TECHNOSTARTERSSКүн бұрын
When is not quite seamless you are prompting it to speak that way in the Open I want you don’t have to prompt it. It automatically adapts to you and it’s voice to voice speech to speech. This one seems like text to speech.
@raydosson2025Күн бұрын
@@TECHNOSTARTERSS this one is not text to speech. that's why the title is "Native audio output".
@UrYesMan5 сағат бұрын
This suits Google a lot. Google's been making Gemini very humanlike and now with this, their vision is even clearer.
@hahoang9542Күн бұрын
Its the early Christmas gift from Google
@wassimharzli33615 сағат бұрын
I think with this, Gemini will become the number one Ai outperforming OpenAi
@AIrtesanКүн бұрын
And you even change the order of the speakers, making the female voice lead. Kudos to the CX team. Wittily played!
@michaelcharlesthearchangelКүн бұрын
A man of Native American ancestry has been feeding all AI developers for the last decade, behind the scenes.
@aliettienne290721 сағат бұрын
2:31 If Gemni could handle both fast speaking and whisperings it shows how much robust ingenuity and development that went to devising the AI model. 😎💯💪🏾👍🏾
@aron2922Күн бұрын
Her is truly here
@__J____ff23 сағат бұрын
it's him & her .... hhhhhhhhh
@aiforcultureКүн бұрын
Exceptional work 👏 love the example of the model intelligently adapting to fit the speed of reply it thinks you need.
@MomixerКүн бұрын
Yes! Please use this for the different KZbin soundtracks, because right now the generated ones are really bad
@CODE7XКүн бұрын
This isnt new but maybe its better than what was out there before! Cant wait to try it
@friedpizza262Күн бұрын
Whoever made this video is cool
@LaPetiteCuillèreКүн бұрын
when is available ?
@HUEHUEUHEPonyКүн бұрын
As soon as Google kill their older products
@aquilesdg4305Күн бұрын
I think it already is
@JaBigKneeGapКүн бұрын
@@aquilesdg4305 And _where_ exactly is it available?
@Ethereal_EnigmaКүн бұрын
It's available right now in Google ai studio @@aquilesdg4305
@MainInternetUserКүн бұрын
Right now on AI Studio
@OumarDicko-c5iКүн бұрын
I will build my IA girlfriend now 😂
@CODE7XКүн бұрын
Haha yes
@games528Күн бұрын
Ah yes, Irtafacial Antelligence
@aron2922Күн бұрын
@@games528 This is funnier than it should be
@IceMetalPunkКүн бұрын
@@games528 In many languages, the adjective comes after the noun.
@flyingstapler1241Күн бұрын
@@games528 It's called IA in many languages
@ShubharthakSangharshaКүн бұрын
2:32: damnnn ok am I'm impressed 👌 👏
@dliedke18 сағат бұрын
Raining is good for running, be more excited for the rain AI!
@DangRenBo4 сағат бұрын
Can we get word timings output together with the tts? We need closed captions for accessibility.
@trutenantedboderamptКүн бұрын
Great! Now we can hear non-sensical facts from history with native audio output!
@IN-pr3lwКүн бұрын
Google doing what OpenAI said they would months ago but we still didnt get 👏
@cagnazzo82Күн бұрын
Actually with advanced voice I was having it speak english, french, elvish, and simlish in one sentence. The actual game-changer is being able to prompt the AI to do this. You can do this through voice commands with OpenAI, but for some reason ignored the ability to prompt for voices. Plus I think the whole 'her' situation got them rattled from voices almost altogether.
@IceMetalPunkКүн бұрын
? OpenAI Advanced Voice mode is already out
@IN-pr3lw21 сағат бұрын
@@IceMetalPunk It's out but its not the one in the ads. It doesnt seem to be Audio to Audio, it still seems to be Audio to text to audio. I could be wrong but it doesn't feel like what we were shown.
@IceMetalPunk17 сағат бұрын
@IN-pr3lw No, Advanced Voice Mode (called the audio-preview model in the API) is fully native audio output. It can do sound effects, tone shifts, voice differences, accents, etc.
@IN-pr3lw16 сағат бұрын
@IceMetalPunk I cant get to to whisper or anything on mine 🤷♂️
@niceplace12323 сағат бұрын
Look amazing, but did anyone get it to work in the actual AI studio? I ran into a ton of bugs, especially with non-English languages.
@flamyf23 сағат бұрын
0:01 Has anyone find "Video understanding" demo? All other topics have a video on this channel
@demonsynthКүн бұрын
Mind blown. Playing with it now :)
@janjahrademusicКүн бұрын
haha yoo that's dope ..well done
@jab396612 сағат бұрын
if I have access, would this only be in iastudio? Or could I use it in code to run other tests, such that the output is audio?
@ThomasOberhoffКүн бұрын
This will put so many call-center agents out of work worldwide
@Chaotic-n5nКүн бұрын
Bro this thing is crazyyy 😱
@DanielMKКүн бұрын
Now that's impressive
@TerrantullaКүн бұрын
I cant help myself but feel like the next decade is going to get very weird
@NutriQlikAI-e4e15 сағат бұрын
what's the point of releasing 2.0 when all the features are not available to test .. Note: Image and audio generation are in private experimental release, under allowlist. All other features are public experimental.
@MichealAngeloArtsКүн бұрын
I don't have the "Output Format" and "Voice" options under "Model" in the AI Studio. I just have the "Token Count" immediately after Model.
@MichealAngeloArtsКүн бұрын
I've just figured it out as I have to change from "Create Prompt" to "Stream Realtime" in the left pane. However I can't seem to change the audio effect. Whispering doesn't work with me although it is demonstrated in the Google post. How can we add these audio effects?
@KiririnКүн бұрын
is the model in the video the flash version? i am unable to get it to whisper or laugh or change how it speaks
@shadydragon22Күн бұрын
Same here
@ShawnFumoКүн бұрын
I think that's the part that is available in January. It is a bit confusing since they ended with saying to go to ai studio...
@shadydragon22Күн бұрын
@@ShawnFumo Oh ok I see! Thanks for clarifying
@RichardPinewoodКүн бұрын
level 4 AI is the next big thing, thats when Science gonna get intresting 😎
@GeneralKenobi69420Күн бұрын
That thumbnail goes hard
@KenykoreКүн бұрын
This is so lovely
@999satyamКүн бұрын
ok that Hindi was nice, damn. Is there a paper on this?
@phiarchitectКүн бұрын
nicely done
@BroskiPlays22 сағат бұрын
This is AVM but with less restrictions
@RickySupriyadi17 сағат бұрын
how much? will I be able to afford them?
@devagarwal3250Күн бұрын
woah this is so cool
@MemeConnoisseurКүн бұрын
who's going to fill the hollow the emptiness? idk something is super weird bout ai generated audio trying to be friendly and humanly..
@AnonymousNyanCat-qg6bb17 сағат бұрын
There is a noticeable difference between the Gemini and the GPT. I am happier with GPT. No doubt, thanks.
@FwuzeemКүн бұрын
How do we get it?
@rakeshkumarrout2629Күн бұрын
lets start building with gemini 2.0
@Mediiiicc22 сағат бұрын
Weird how we can still tell the voice is AI. The lack of errors makes it noticable. Just like how a jewler can tell apart real and fake diamonds since real diamonds have imperfections that fake diamonds dont have.
@AyyazZafarКүн бұрын
I tried but it does not whisper yet.
@snowhan7006Күн бұрын
incredible❤❤
@DistortedV12Күн бұрын
GOOGLE ships! Pixel phone stocks jumping!
@Happ1nessКүн бұрын
Hopefully it's not another lie. We all remember the Gemini "hands on demo".
@DarxKies21 сағат бұрын
That was fun!
@Blooper1980Күн бұрын
Pretty epic
@ruchirahasaranga8076Күн бұрын
it does not support Sinhala language!
@pandoraeeris7860Күн бұрын
I need an agent that can use any program on my computer. Just give us AIOS.
@J3R3MI6Күн бұрын
Exactly
@CODE7XКүн бұрын
Exactly, but yes its already out , but for browser so far , and not released yet .... I hope google releases one :0
@ShpanManКүн бұрын
Nothing that OpenAI's model can't do so far, but hey more competition is better for everyone!
@IceMetalPunkКүн бұрын
Hopefully it has cheaper API access. I blew through so much money just testing a few use cases of the OpenAI audio model through the API.
@DominickZollinger-e3rКүн бұрын
❤
@mightynathaniel5355Күн бұрын
Would be better and more impressive if it kept the same voice or character when switching languages rather than using a totally different voice for each language. But all fun and looking forward to using this model.
@ROHIT-wx4nuКүн бұрын
This is how tts ends😂😂😂
@1brokkolibaum23 сағат бұрын
I wonder why I am able to use it on my pc, but my phone doesnt have 2.0 unlocked 😮💨
@vectoralphaSecКүн бұрын
AGI is coming soon 2025
@JaBigKneeGapКүн бұрын
Dude, I swear. Like, that clock slaps 12 on, idk, january 5? I swear AGI will be here. Or anytime afterward.
@3thinking15 сағат бұрын
Wild!
@stochastic8417 сағат бұрын
I thought the voice felt very unnatural, and then the video answered why lol.
@braineaterzombie3981Күн бұрын
Gimme my sandevistan , time to get chromed up
@pathringКүн бұрын
한국어 스피치는 조금 부자연스럽군요
@lakshiBroКүн бұрын
Oh well.
@MidgarMerc23 сағат бұрын
Surely this won't be used to cause suffering at the expense of talented voice actors just so rich creeps get even richer. Surely.
@4letterdcКүн бұрын
hell yeah
@InternetKilledTV21Күн бұрын
Oh Calculon
@andreaserrano380922 сағат бұрын
Being playing with it and only supports english lol... I guess this is just a smaller demo model to show for now
@eve_mtpl18 сағат бұрын
It sounds like an hypocritical IA, can we go back to the i sound like an IA IA
@AstroZoe1804Күн бұрын
I love it
@dfas1497tcf317 сағат бұрын
시도해 봤는데, 아직은 적용안됨.
@ashleigh3021Күн бұрын
I don’t like the tone, cadence structure. They should call it “podcast voice”
@Selene-xf9yi19 сағат бұрын
Cool
@BoydLIN-c3wКүн бұрын
I haven’t found Chinese 😂
@notvedxpКүн бұрын
😮
@donny603613 сағат бұрын
this is way too natural. i have to keep telling myself that this is generated.
@alejandromedina10199 сағат бұрын
banger
@joelcarter9137Күн бұрын
Wow! That is completely pointless!
@bluepandamanКүн бұрын
What.. are you even talking about. How is this pointless?