Love it, I am still waiting for the production like video :)
@MaliRasko3 ай бұрын
Talked with it for 5 min in the playground today. The cost was $2.35. Not too shabby.
@i2Sekc4U3 ай бұрын
That’s pretty expensive. Especially if you wanted to build something with this for consumers, think about how pricy it would get. Monthly subscriptions would have to be like $50
@ChrizzeeB3 ай бұрын
That's a 1990s sexline... What service would work at that price?
@yurijmikhassiak73423 ай бұрын
The price is $20/hour. Like a junior sales rep.
@danacarvey3 ай бұрын
What I want to know is if you can interrupt it?
@yurijmikhassiak73423 ай бұрын
@MaliRasko yes you can interrupt it, and it has automatic voice detection. So you pay only for the time you speak, not for silence. Still $20 for an hour of conversation requires a solid use case.
@BrianDevJourney3 ай бұрын
Great tool, if this was cheaper I would develop with it. Also, just emailed you about a sponsor opportunity. Cheers!
@DevelopersDigest3 ай бұрын
Cheers - I’ll have a look. Agree, I think as the price comes down it will be much more viable for more apps
@BrianDevJourney3 ай бұрын
@@DevelopersDigest Hey developer digest, following up here. Did you see my email? Thanks!
@kelvindimson3 ай бұрын
This is crazy!!
@adityakale552 ай бұрын
how to end call , how do we know if last audion has been played
@nhtna47063 ай бұрын
What would be the cost of the api usage given a scenario where there calls volume goes between 200000 min in a given month?? On an avg. cos it involves calls that goes on for hours n 10000 of calls,.
@mizukireview171712 күн бұрын
is this model can real file?
@manoharants2 ай бұрын
When i give phonenumber as voice input, numbers gets mixed up. Could you help me?
@SirHeliosАй бұрын
I have the same issue, also difficulties understanding the last name. Twilio was more accurate
@seecmellikewАй бұрын
Any luck deploying?
@DevelopersDigestАй бұрын
I haven’t had a chance to circle back to this yet! I did see cloudflare had a really nice looking relay for this though that I have been meaning to try!
@jaysonp94263 ай бұрын
What was the latency? Also is there a way to have it await the function call return via the websocket? Def a non starter if we just have to deal with it coming back in pieces
@micbab-vg2mu3 ай бұрын
thanks :)
@DevelopersDigest3 ай бұрын
Thanks for watching!
@ibrahimaba89663 ай бұрын
This API is too expensive; I think we should avoid sending all chunks. We need a local VAD (Voice Activity Detection) to send only the chunks that contain voice; otherwise, it could become costly.
@ivan3584Ай бұрын
Always on 429 *to many req
@DevelopersDigestАй бұрын
Oh interesting - I hadn’t thought about the rate limit for this offering. I haven’t run into any issues yet
@nastastic3 ай бұрын
can you make a cartoon character voice with it?
@johnnylarue39333 ай бұрын
Crazy expensive @3.6 min cost $12.
@AI_Escaped3 ай бұрын
Yup, just tinkering around to figure out how things work will drain your account. I don't see many people using this unless they have huge funding. Guess most of us will have to wait for open source or when openai drops the price later. Horrible pricing OpenAI.
@AI_Escaped3 ай бұрын
And the voice sounds like crap
@johnnylarue39333 ай бұрын
@AI_Escaped I'm sure it's going to drop in price in a year from now... but I was hoping to start using this today for many usecases... like many others, I cobbled together a version of this using VAD, STT and TTS to/from GPT Chat Completions which wasn't overly fast to initial response (3-6 seconds), but otherwise a decent two-way conversation. I am going to try handling VAD and STT (send as text is 1/10th the cost) to see if this balances the tradeoff of converting to text to lower cost to use.
@ibrahimaba89663 ай бұрын
It’s normal; this system sends everything to the model, even if you’re not saying anything. It keeps filling the buffer, so we need to add a local VAD.