Using Groq / Mistral AI instead of OpenAI will greatly reduce the latency issue you have in your demo.
@logannon8 ай бұрын
can you fine tune groq?
@AssemblyAI7 ай бұрын
Great suggestion, we will explore this in the next tutorial. This one was meant to be as accessible as possible so that people could build quickly.
@조바이든-r6r6 ай бұрын
@@logannon no its impossible to fine tune groq. thats the problem. you have to use rag instead of fine tuning. but if you wanna make chatbot for specific domain you should try other service
@TrilioniME3 ай бұрын
How much does Mistral API cost?
@fatmayonca17234 ай бұрын
How is it from scratch? You are using 3 Api. Also assembly api doesn't transcribe live audio streams without setting up billing. You have to put minimum 10 dollars in it for that too. I don't have a problem with that. But I have a problem you not telling this in advance, at the start of the video. You actually never mention this in anywhere in the video. It doesn't respond after the introduction. That's how you find out the problem is billing. Not from the video. That was quite annoying to be honest. Potentially great video ruined by lack of transparency.
@yashmehta929914 күн бұрын
If you wish to make an apple pie from scratch, you must first invent the universe - Carl Sagan
@thesohailjafriКүн бұрын
I guess they have "Start building with the $50 free credit!" policy now
@PalashDandge6 ай бұрын
i am getting error "Cannot find reference 'generate' in '__init__.py' " on from elevenlabs import generate, stream line can you please help me to resolve this issue
@JeffreyJohnson-vy1zm8 ай бұрын
Two questions: How can we improve the latency between the patient's response and the AI voice reply? and What can be done for the AI Voice to account for patient input if the patient speaks while the AI voice is speaking?
@AssemblyAI7 ай бұрын
Hi Jeffrey, two very good questions! These deserve a video on their own, to be honest. To improve latency one thing you could try is running the LLM locally so you can get a faster inference over calling openai's API. As for handling overlapping speech, I've written the program to stop listening when the AI voice is responding back. But what you could do, is run another thread that is still listening while the AI voice is speaking.
@EvertvanBrussel7 ай бұрын
As for the latency, I was assuming the majority of the latency was actually coming from ElevenLabs? And likely also from whatever functions might be needed to actually check the availability of the dentist and then also to schedule the actual appointment in the end. Am I wrong? So yeah I think running the LLM locally will surely help, or using Groq, but I'm not convinced yet that that is the biggest bottleneck.
@mehmetbakideniz3 ай бұрын
would you consider adding a web UI like gradio to this app so that we can send the demo to anyone if needed. this version only works if you run the actual code in your own environment.
@simonsandeep49776 ай бұрын
The programming is not responding after the first introduction ,as shown in the video ;though even after using the github code. Any alternative with step by step instruction video ?
@FaisalKhrisan6 ай бұрын
But I still have problems it says that [from elevenlabs import generate, stream ImportError: cannot import name 'generate' from 'elevenlabs'] how come
@Ghosty00695 ай бұрын
i have the exact same error did you fix it ?
@LO-FI_walah_BABA22 күн бұрын
change the version of python to 1.10 or +
@JokerJarvis-cy2sw8 ай бұрын
Please a tutorial on llava vision model to analyze video live with cv2 And I am unable to get my API token from assembly AI website please fix it
@randotkatsenko51575 ай бұрын
Hi nice tutorial. I have coded real-time voice bot for phone conversations in Twilio. The latency comes from text-to-speech mostly and gpt response time. I'm guesing if either ones speed can be reduced about 2-3x, then the response time would be fast enough. In human conversation, we expect the response within 1 second....and anything above that seems more unnatural. I'm sure the speed issues will be solved with new Nvidia GPU-s or other hardware innovations.
@rammohanbethi5 ай бұрын
Hi, can you please let me know how you developed the voice bot using Twilio’s, even I’m looking for such kind of bot. It would be helpful
@randotkatsenko51575 ай бұрын
@@rammohanbethi Hi, how can you let you know - its a lot of complicated server side code in node js and some python... The setup is too complex to explain in a comment. We make this as part of AI automation services for businesses.
@Sibixpur3 ай бұрын
@@randotkatsenko5157 bro speaking as if he coded all the logic voice bot , bruhh you're just hitting API's that ain't complex....
@yitaowang85472 ай бұрын
Thank you! Such a useful application and well explained ❤
@uttamdwivedi77097 ай бұрын
I followed this tutorial then in the end I realized .. assemblyAI doesn't provide the support for the Japanese language in the live Reltimetranscriber. Which sucks .. lol can't use it. Any help? @assemblyAI
@bens44465 ай бұрын
Thanks. First time I hear of AssemblyAI. Everyone talks about faster_whisper and Deepgram. Is AssemblyAI better for STT?
@GoDFazel2Ай бұрын
no its not
@daeralbra7 ай бұрын
The only downside is the fact it takes a while to respond with voice.
@iainhmunro6 ай бұрын
Hi There - I was just looking at the code. Where is the appointment setting details / info coming from ?
@AssemblyAI6 ай бұрын
All that is coming from the LLM we are using, so it's not hard-coded.
@shissncg2 ай бұрын
How do you grab the audio once the RealtimeTranscript has finalized? For example, could you pass the audio rather than the text to generate_ai_response?
@TheBestgoku7 ай бұрын
why not chunk text and output instead of output after all text is generated?
@thebackpainmiracle6 ай бұрын
Exactly what I was intending on making. Thanks!
@MuskaanKhan.314 ай бұрын
Hey there are you learning to create generative ai models If yes please reply I have project for you By creating this project you can practice how to create ai model as well as you can include this in your resume for job search and this will also be help full for me
@avataraang33344 ай бұрын
@@MuskaanKhan.31 I am interested in a project! Just need required data and the objective you have in mind
@jhinaouiroudayna42754 ай бұрын
assembly ai APIs requires a credit card for this task
@nagarajdoddamani697Ай бұрын
in py laptop the brew not installing, and also in program is not working
@abdulazad84322 ай бұрын
Can it be inducted into Aurdino board?
@yuchengpeng77067 ай бұрын
This video is so great! I'm following your video but now I ran into this problem, I can install the package in Pycharm with Windows system, but I got this error: OSError: Cannot find mpv-1.dll, mpv-2.dll or libmpv-2.dll in your system %PATH%. I'm a researcher in the art field with only a debutant python knowledge, could you help me solve this problem? Thanks a lot!
@abibusiness108520 күн бұрын
How to install mpv on windows?
@euginekholmogorov51967 ай бұрын
amazing lady and also an engineer omg)) thank you a million, I'll just add this to my stack
@urekmazino13276 ай бұрын
why are you saying fro. scratch if you're only using api
@theghostyced6 ай бұрын
how would you handle interruptions while the ai is talking?
@sarap.sadegh46917 ай бұрын
hi thanks for your video . i want Api real time conversation with python for Farsi language . the LLM support Farsi language?
@Akash-nb9sv2 ай бұрын
may how to install brew does not have for windows other option for windows
@Alex-qo5je7 ай бұрын
How can i conect to my phone number and google calendar?🙏🏼
@AssemblyAI6 ай бұрын
You can make use of the Google API for google calendar and something like Twilio's API for making phone calls.
@vishalsaichindepalli27988 ай бұрын
For some reason, the microphone isn't picking up my voice. I enabled all permissions on my mac and am still having trouble. Is there any way to fix this?
@michaelnumnum7 ай бұрын
I think you need to pay for the real-time transcription for this at AssemblyAI
@Vrilogs6 ай бұрын
streaming from assembly ai is a paid service. So, first you need add balance into your account. If you have not done that yet. Hope that helps :)
@JR-jorenАй бұрын
nice but the lagging time is too long unfortunately.
@jeevanjaison96466 ай бұрын
The assembly ai api is not free.
@urekmazino13276 ай бұрын
any way to make one with adam voice like the one in elevenlabs?😊
@alifetechgenius38042 ай бұрын
Source code Not Available
@CharlesZulu-v8gАй бұрын
your free api does not work in my project
@mrunexpected108 ай бұрын
can u make just a chat bot word to voice
@pawanmaurya155427 күн бұрын
❤❤❤❤❤so wonderful project
@ac3inlondon5315 ай бұрын
why are you using Mac omg
@viditsharma69907 ай бұрын
i am facing the mpv value error on windows i already installed it many times how can i fix that
@sethuraman98846 ай бұрын
just use vlc instead mpv bro
@조바이든-r6r6 ай бұрын
@@sethuraman9884 thank you guys
@조바이든-r6r6 ай бұрын
or check environment path of mpv. when you command mpv --version on cmd. you have to see its running
@nithishreddy76846 ай бұрын
An error occured: Could not connect to the real-time service: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:997) what to do with this error?
@islamicinterestofficial6 ай бұрын
same error. You found the solution?
@chittisai476 ай бұрын
most likely your microphone is switched off pls check
@rachid69045 ай бұрын
i've got same: An error occured: Could not connect to the real-time service: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1006)
@mehdismaeili37436 ай бұрын
Excellent .
@nickvaleontis4657Ай бұрын
From scratch is misleading as others already commented.
@MiguelCayazaya6 ай бұрын
I am very api to have found this
@Marvinzock34Ай бұрын
No thats not scratch i have no money stop making me hope
@BernardoCastro-eb6rp5 ай бұрын
TOO SLOW !
@drmarioschannel8 ай бұрын
after watching your video, i think i prefer interacting with humans