OpenAI Realtime Voice API: A 7-Minute Getting Started Guide

  Рет қаралды 17,983

Developers Digest

Developers Digest

Күн бұрын

Пікірлер: 32
@vcarrascoring
@vcarrascoring 2 ай бұрын
Love it, I am still waiting for the production like video :)
@MaliRasko
@MaliRasko 3 ай бұрын
Talked with it for 5 min in the playground today. The cost was $2.35. Not too shabby.
@i2Sekc4U
@i2Sekc4U 3 ай бұрын
That’s pretty expensive. Especially if you wanted to build something with this for consumers, think about how pricy it would get. Monthly subscriptions would have to be like $50
@ChrizzeeB
@ChrizzeeB 3 ай бұрын
That's a 1990s sexline... What service would work at that price?
@yurijmikhassiak7342
@yurijmikhassiak7342 3 ай бұрын
The price is $20/hour. Like a junior sales rep.
@danacarvey
@danacarvey 3 ай бұрын
What I want to know is if you can interrupt it?
@yurijmikhassiak7342
@yurijmikhassiak7342 3 ай бұрын
@MaliRasko yes you can interrupt it, and it has automatic voice detection. So you pay only for the time you speak, not for silence. Still $20 for an hour of conversation requires a solid use case.
@BrianDevJourney
@BrianDevJourney 3 ай бұрын
Great tool, if this was cheaper I would develop with it. Also, just emailed you about a sponsor opportunity. Cheers!
@DevelopersDigest
@DevelopersDigest 3 ай бұрын
Cheers - I’ll have a look. Agree, I think as the price comes down it will be much more viable for more apps
@BrianDevJourney
@BrianDevJourney 3 ай бұрын
@@DevelopersDigest Hey developer digest, following up here. Did you see my email? Thanks!
@kelvindimson
@kelvindimson 3 ай бұрын
This is crazy!!
@adityakale55
@adityakale55 2 ай бұрын
how to end call , how do we know if last audion has been played
@nhtna4706
@nhtna4706 3 ай бұрын
What would be the cost of the api usage given a scenario where there calls volume goes between 200000 min in a given month?? On an avg. cos it involves calls that goes on for hours n 10000 of calls,.
@mizukireview1717
@mizukireview1717 12 күн бұрын
is this model can real file?
@manoharants
@manoharants 2 ай бұрын
When i give phonenumber as voice input, numbers gets mixed up. Could you help me?
@SirHelios
@SirHelios Ай бұрын
I have the same issue, also difficulties understanding the last name. Twilio was more accurate
@seecmellikew
@seecmellikew Ай бұрын
Any luck deploying?
@DevelopersDigest
@DevelopersDigest Ай бұрын
I haven’t had a chance to circle back to this yet! I did see cloudflare had a really nice looking relay for this though that I have been meaning to try!
@jaysonp9426
@jaysonp9426 3 ай бұрын
What was the latency? Also is there a way to have it await the function call return via the websocket? Def a non starter if we just have to deal with it coming back in pieces
@micbab-vg2mu
@micbab-vg2mu 3 ай бұрын
thanks :)
@DevelopersDigest
@DevelopersDigest 3 ай бұрын
Thanks for watching!
@ibrahimaba8966
@ibrahimaba8966 3 ай бұрын
This API is too expensive; I think we should avoid sending all chunks. We need a local VAD (Voice Activity Detection) to send only the chunks that contain voice; otherwise, it could become costly.
@ivan3584
@ivan3584 Ай бұрын
Always on 429 *to many req
@DevelopersDigest
@DevelopersDigest Ай бұрын
Oh interesting - I hadn’t thought about the rate limit for this offering. I haven’t run into any issues yet
@nastastic
@nastastic 3 ай бұрын
can you make a cartoon character voice with it?
@johnnylarue3933
@johnnylarue3933 3 ай бұрын
Crazy expensive @3.6 min cost $12.
@AI_Escaped
@AI_Escaped 3 ай бұрын
Yup, just tinkering around to figure out how things work will drain your account. I don't see many people using this unless they have huge funding. Guess most of us will have to wait for open source or when openai drops the price later. Horrible pricing OpenAI.
@AI_Escaped
@AI_Escaped 3 ай бұрын
And the voice sounds like crap
@johnnylarue3933
@johnnylarue3933 3 ай бұрын
@AI_Escaped I'm sure it's going to drop in price in a year from now... but I was hoping to start using this today for many usecases... like many others, I cobbled together a version of this using VAD, STT and TTS to/from GPT Chat Completions which wasn't overly fast to initial response (3-6 seconds), but otherwise a decent two-way conversation. I am going to try handling VAD and STT (send as text is 1/10th the cost) to see if this balances the tradeoff of converting to text to lower cost to use.
@ibrahimaba8966
@ibrahimaba8966 3 ай бұрын
It’s normal; this system sends everything to the model, even if you’re not saying anything. It keeps filling the buffer, so we need to add a local VAD.
@TéonMèhta
@TéonMèhta 3 ай бұрын
@@johnnylarue3933 This is the way.
Lovable: Is This the Fastest Way to Build Web Apps with AI?
17:06
Developers Digest
Рет қаралды 18 М.
Chain Game Strong ⛓️
00:21
Anwar Jibawi
Рет қаралды 41 МЛН
We Attempted The Impossible 😱
00:54
Topper Guild
Рет қаралды 56 МЛН
Мясо вегана? 🧐 @Whatthefshow
01:01
История одного вокалиста
Рет қаралды 7 МЛН
Beat Ronaldo, Win $1,000,000
22:45
MrBeast
Рет қаралды 158 МЛН
916. Word Subsets | Hash Map | Strings
14:44
Aryan Mittal
Рет қаралды 1,3 М.
The Future of Knowledge Assistants: Jerry Liu
16:55
AI Engineer
Рет қаралды 127 М.
A Free Relay Server for OpenAI's New Realtime API on Cloudflare Workers
9:20
Cloudflare Developers
Рет қаралды 3 М.
Creating JARVIS - Your Voice Assistant with Memory
13:47
Prompt Engineering
Рет қаралды 10 М.
Introducing the OpenAI Realtime API
7:10
Nuclear Geek
Рет қаралды 2 М.
OpenAI Vision API Crash Course - Chat with Images (Node)
13:33
Leon van Zyl
Рет қаралды 8 М.
Function Calling in the OpenAI Realtime API
7:50
Greg Yeutter
Рет қаралды 1,1 М.
Qwen Just Casually Started the Local AI Revolution
16:05
Cole Medin
Рет қаралды 123 М.
Chain Game Strong ⛓️
00:21
Anwar Jibawi
Рет қаралды 41 МЛН