That first one may as well have called you senpai. 😂
@jorgeconsulting5 ай бұрын
Your content is top notch man. It’s really easy to digest.
@lakergreat15 ай бұрын
the #1 video I look forward to each week, sent you an email as well
@ew39955 ай бұрын
would it be possible to further reduce latencies by streaming transcription and streaming response
@jorgeconsulting5 ай бұрын
That’s what I was thinking too
@brianmorin55475 ай бұрын
Same thing I’ve been playing with these week. I want to carve out some time to take chunks from the streaming to create a loop of sending and receiving from the TTS API for super low latency I think the biggest problem is maintaining continuity of the voice so perhaps render the first sentence locally and while it plays assemble the next audio file or two server-side then send back?????
@thetagang68545 ай бұрын
Amazing, so much value in a PA. Can't wait for speech-to-speech models to come to the market, super natural convos.
@ronisaroniemi85015 ай бұрын
Amazing content - keep up this combo of practical + high level videos 💪
@christyson42455 ай бұрын
As we say in the UK - this is the dogs bollox! Fantastic work as always Dan.
@6lack5ushi5 ай бұрын
you can get faster calls using Groq new LARGE whisper, and Llama 3.1 70bn if you stream the audio ASAP and have tiny chunk sizes you can down to sub 1 second responses
@zipaJopa5 ай бұрын
Do you have any repos on hand that showcases this setup? Many thanks!
@6lack5ushi5 ай бұрын
@@zipaJopa I can dig one up and put it on git gimme a few hours, I have an iOS version but it’s faffy. Will reduce it to python and write a simple readme
@6lack5ushi5 ай бұрын
@@zipaJopa Just rewrote the script will post it to GIT in a few mins and share the link here. (y) might make a video on it actually... Thanks for asking
@zipaJopa5 ай бұрын
@@6lack5ushi my hero! 💕
@pajarobobo44675 ай бұрын
@@6lack5ushiwheres the link?
@ytubeanon4 ай бұрын
that's neat, I've been trying to get TTS to work for open-interpreter which I use a lot with gpt-4o-mini
@TimothyJoh5 ай бұрын
This was so great. I left you a PR on the repo, waiting for your feedback. Would love to demo it for you.
@yuniorgonzalez46384 ай бұрын
Amazing job !!!
@ModernCentrist5 ай бұрын
What would be the main difference between the custom voice assistant and the OpenAI voice mode?
@seventhapex5 ай бұрын
do you know what the word "own" means?
@YorkyPoo_UAV5 ай бұрын
Is there a way to run this on your mobile device and have it in a call/open conversation function?
@andrewandreas57955 ай бұрын
Does anybody know a STT model that could be run locally for live transcript?
@ariramkilowan80515 ай бұрын
Love the content but I think it would be helpful to at least mention relative costs. You've said in the past that the API costs are worth the investment. You are likely correct but probably still worth mentioning. Thanks again for the content.
@mrd68695 ай бұрын
The top area for this will be Cybersecurity hands down. In years time there will be entire cyberwars being waged and run by AI systems.
@Mosen_xd4 ай бұрын
perfect
@EternalKernel5 ай бұрын
Eleven labs pricing structure is prohibitive. Very prohibitive.
@BaldyMacbeard5 ай бұрын
"Personal AI is TOO valuable to leave in the hands of Big Tech"... proceeds to build his AI assistant with APIs hosted by "big tech". Kinda expected to see whisper, llama, mycroft or sth.
@littledovecitydust5 ай бұрын
this is just a voiceflow type chatbot, it's not your own assistant.
@orthodox_gentleman4 ай бұрын
Good point. I don’t really understand the point of this.