CONTROL your Personal AI Assistant with GPT-4o mini & ElevenLabs (AI TTS & STT)

  Рет қаралды 6,701

IndyDevDan

IndyDevDan

Күн бұрын

Пікірлер: 38
@WeeklyTubeShow2
@WeeklyTubeShow2 5 ай бұрын
That first one may as well have called you senpai. 😂
@jorgeconsulting
@jorgeconsulting 5 ай бұрын
Your content is top notch man. It’s really easy to digest.
@lakergreat1
@lakergreat1 5 ай бұрын
the #1 video I look forward to each week, sent you an email as well
@ew3995
@ew3995 5 ай бұрын
would it be possible to further reduce latencies by streaming transcription and streaming response
@jorgeconsulting
@jorgeconsulting 5 ай бұрын
That’s what I was thinking too
@brianmorin5547
@brianmorin5547 5 ай бұрын
Same thing I’ve been playing with these week. I want to carve out some time to take chunks from the streaming to create a loop of sending and receiving from the TTS API for super low latency I think the biggest problem is maintaining continuity of the voice so perhaps render the first sentence locally and while it plays assemble the next audio file or two server-side then send back?????
@thetagang6854
@thetagang6854 5 ай бұрын
Amazing, so much value in a PA. Can't wait for speech-to-speech models to come to the market, super natural convos.
@ronisaroniemi8501
@ronisaroniemi8501 5 ай бұрын
Amazing content - keep up this combo of practical + high level videos 💪
@christyson4245
@christyson4245 5 ай бұрын
As we say in the UK - this is the dogs bollox! Fantastic work as always Dan.
@6lack5ushi
@6lack5ushi 5 ай бұрын
you can get faster calls using Groq new LARGE whisper, and Llama 3.1 70bn if you stream the audio ASAP and have tiny chunk sizes you can down to sub 1 second responses
@zipaJopa
@zipaJopa 5 ай бұрын
Do you have any repos on hand that showcases this setup? Many thanks!
@6lack5ushi
@6lack5ushi 5 ай бұрын
@@zipaJopa I can dig one up and put it on git gimme a few hours, I have an iOS version but it’s faffy. Will reduce it to python and write a simple readme
@6lack5ushi
@6lack5ushi 5 ай бұрын
​@@zipaJopa Just rewrote the script will post it to GIT in a few mins and share the link here. (y) might make a video on it actually... Thanks for asking
@zipaJopa
@zipaJopa 5 ай бұрын
@@6lack5ushi my hero! 💕
@pajarobobo4467
@pajarobobo4467 5 ай бұрын
@@6lack5ushiwheres the link?
@ytubeanon
@ytubeanon 4 ай бұрын
that's neat, I've been trying to get TTS to work for open-interpreter which I use a lot with gpt-4o-mini
@TimothyJoh
@TimothyJoh 5 ай бұрын
This was so great. I left you a PR on the repo, waiting for your feedback. Would love to demo it for you.
@yuniorgonzalez4638
@yuniorgonzalez4638 4 ай бұрын
Amazing job !!!
@ModernCentrist
@ModernCentrist 5 ай бұрын
What would be the main difference between the custom voice assistant and the OpenAI voice mode?
@seventhapex
@seventhapex 5 ай бұрын
do you know what the word "own" means?
@YorkyPoo_UAV
@YorkyPoo_UAV 5 ай бұрын
Is there a way to run this on your mobile device and have it in a call/open conversation function?
@andrewandreas5795
@andrewandreas5795 5 ай бұрын
Does anybody know a STT model that could be run locally for live transcript?
@ariramkilowan8051
@ariramkilowan8051 5 ай бұрын
Love the content but I think it would be helpful to at least mention relative costs. You've said in the past that the API costs are worth the investment. You are likely correct but probably still worth mentioning. Thanks again for the content.
@mrd6869
@mrd6869 5 ай бұрын
The top area for this will be Cybersecurity hands down. In years time there will be entire cyberwars being waged and run by AI systems.
@Mosen_xd
@Mosen_xd 4 ай бұрын
perfect
@EternalKernel
@EternalKernel 5 ай бұрын
Eleven labs pricing structure is prohibitive. Very prohibitive.
@BaldyMacbeard
@BaldyMacbeard 5 ай бұрын
"Personal AI is TOO valuable to leave in the hands of Big Tech"... proceeds to build his AI assistant with APIs hosted by "big tech". Kinda expected to see whisper, llama, mycroft or sth.
@littledovecitydust
@littledovecitydust 5 ай бұрын
this is just a voiceflow type chatbot, it's not your own assistant.
@orthodox_gentleman
@orthodox_gentleman 4 ай бұрын
Good point. I don’t really understand the point of this.
@itskittyme
@itskittyme 5 ай бұрын
cringe lol
Cat mode and a glass of water #family #humor #fun
00:22
Kotiki_Z
Рет қаралды 42 МЛН
Sigma Kid Mistake #funny #sigma
00:17
CRAZY GREAPA
Рет қаралды 30 МЛН
Automate EVERYTHING Through ChatGPT ✨
29:13
No-Code Ireland
Рет қаралды 41 М.
AI Copyright Claimed My Last Video
24:11
Venus Theory
Рет қаралды 734 М.
PydanticAI Agents that Code
15:09
Riza, Inc.
Рет қаралды 2,6 М.
Run your own AI (but private)
22:13
NetworkChuck
Рет қаралды 1,8 МЛН
I rigged ChatGPT's new memory into my personal JARVIS
20:52
I versus AI
Рет қаралды 10 М.
GPT-4o is WAY More Powerful than Open AI is Telling us...
28:18
MattVidPro AI
Рет қаралды 277 М.
Windsurf vs Cursor: In-Depth AI Code Editor Comparison
18:14
Yifan - Beyond the Hype
Рет қаралды 23 М.
Cat mode and a glass of water #family #humor #fun
00:22
Kotiki_Z
Рет қаралды 42 МЛН