Create a ChatGPT Voice Assistant in 8 Minutes (Python Tutorial)

  Рет қаралды 466,707

Ai Austin

Ai Austin

Күн бұрын

Пікірлер: 665
@Ai_Austin
@Ai_Austin 5 ай бұрын
Build Your Own GPT-4o Voice Assistant in Python with Groq, Llama3, OpenAI-TTS & Faster-Whisper kzbin.info/www/bejne/ppqZmKWVjq2Gq8U
@Bluem0onJ
@Bluem0onJ Жыл бұрын
I had some problems with the speak and talk part, so it ended up like a chatbot that works with "hotkeys"/command triggers for input to make specific things. Like, command trigger "music" opens a youtube playlist and things like that. I'm happy with the results :) edit: now It can "talk"... I generated some phrases on viocevox, downloaded the audio files and made it play along with the texts in the code at some key points
@axystorm5269
@axystorm5269 Жыл бұрын
can you please share your code with me? im interested in the music part. thank you :D
@SwapnilAngarkhe
@SwapnilAngarkhe Жыл бұрын
you prolly doing it in windwos. the say function doesnt work in windows its the same case as mine.
@thegamingdemon4788
@thegamingdemon4788 Жыл бұрын
Bro, can you please share your code? I kind of need it for a school project. I'll make it worth your time I swear!
@yurialves6237
@yurialves6237 Жыл бұрын
Can you make a video showing how you doing this avatar talk. Cheers
@mulin-ms6qg
@mulin-ms6qg Жыл бұрын
+1
@oladejiolaoluwa9155
@oladejiolaoluwa9155 Жыл бұрын
I would love to know how....
@kevinspyxboy9930
@kevinspyxboy9930 Жыл бұрын
It's actually easy you first generate a face of your choice on midjourney or dall-e and you import that picture in an app like facedance, ifunface, speakpick, etc..
@xlr555usa
@xlr555usa Жыл бұрын
Unreal Engine Metahumans can do it but it is intense, Metahumans looks more realistic also
@highwaynumber1265
@highwaynumber1265 Жыл бұрын
Cheers
@LexifromZargon
@LexifromZargon Жыл бұрын
idk if i commented before but i really enjoy this. its simple to understand and easy to follow especially youre clean code and the use of comments makes it verry easy to code allong and to customize stuff as needed.
@LE6ENDTV
@LE6ENDTV Жыл бұрын
Not sure if anyone asked you this... How did you create your AI visual in place of your face? I'm totally intrigued!
@MarcDimmick
@MarcDimmick Жыл бұрын
Great example, well explained and acutally works. I have tried multiple youtube examples and forever end up in a rabbit hole spiral with chatgpt providing corrections to then only create further errors. I really liked how you explained each function and process. I was a great tutorial in provide clear and precise instructions that were very informative. Thank you.
@Ai_Austin
@Ai_Austin Жыл бұрын
Very glad that helped, Thanks for the feedback Marc! More python tutorials coming!
@PhoenixVids123
@PhoenixVids123 Жыл бұрын
@@Ai_Austin Hi, i just wanted to ask about an error I am getting! I have done pip3 to install everything, and when I run it I get an error saying pyaudio wants installed. I go and do everything "pip3 install pyaudio" "pip install pyaudio" etc. Nothing is working, it does like half of it then says that "Could not build wheels for pyaudio" blah blah blah. Then it says that there's an error with "#include 'portaudio.h' ". Do you know how to fix this?????
@jahnaviaddanki8762
@jahnaviaddanki8762 10 ай бұрын
@@PhoenixVids123 in windows it will show for me like that, so i am using raspberry pi the pip install pyaudio worked on the pi
@marekpalczowski9722
@marekpalczowski9722 Жыл бұрын
I have just tried to do something like this for my program, but you are the first one, thank you very much, great job. Now I will use it for my program. Thank you.
@mcfubumaki8359
@mcfubumaki8359 Жыл бұрын
Sure, here's an improved version of your statement: "I used ChatGPT to analyze the script of this video and engaged in a conversation where ChatGPT3 generated a micro-detailed strategy to guide you through every last detail that you might need to know. In summary, during our conversation, you asked about creating a GPT-3 powered voice assistant with Python. I provided you with a step-by-step guide that covers everything from importing necessary libraries and setting up the OpenAI API key to defining functions for transcribing audio to text, generating responses, and speaking responses. We also discussed the importance of error handling and adding additional features to improve the accuracy and usefulness of the voice assistant."
@deussayajin1441
@deussayajin1441 Жыл бұрын
Opa
@sarkis3791
@sarkis3791 Жыл бұрын
help please it says An error occured: You exceeded your current quota, please check your plan and billing details.
@kubapuchalski8633
@kubapuchalski8633 Жыл бұрын
I understood nothing, but damn IT stuff and programing is fascinating, it would probably take me 1000 years to learn it, that is why all I can do is admire people like you.
@Ai_Austin
@Ai_Austin Жыл бұрын
I bet you could learn it. Its not reserved for some super high iq humans. Checkout the free online book “automate the boring stuff with python”. Give yourself a month. Study it 1-2 hour 3 times a week, this program will look like fluent english!
@kubapuchalski8633
@kubapuchalski8633 Жыл бұрын
@@Ai_Austin thx buddy, I'll give it a try :)
@HeistAlex
@HeistAlex Жыл бұрын
this is a great tutorial!. I really love it if you upgrade it. What i mean by upgrade is that, import the python programme in to any type of device such as arduino or raspberry pi ( If possible). Make it wireless.
@defaultHandle1110
@defaultHandle1110 Жыл бұрын
You can make a iPhone shortcut with the api. It’s one response though, but works really well. Using your api key. No programming needed.
@oklahomajefferson4625
@oklahomajefferson4625 Жыл бұрын
Thanks laddie! I've been scratching my head with assemblyAI for days to make this to work, this went seemingly!
@itsmeark
@itsmeark 8 ай бұрын
i know right assembly ai is so much money
@sebastianbolo2480
@sebastianbolo2480 Жыл бұрын
The avatar is so well made, every one i saw blinks so much but this one does it at a reasonable phase.
@MarcDimmick
@MarcDimmick Жыл бұрын
If you are looking for a topic for your next video, I would love to see you take this to a web interface using flask. I have been trying so many different ways from other videos but always end up in a dark rabbit how with a chatbot, unable to find something that works. It keeps recommending code that breaks in so many ways and loses the original function that was working.
@Gotiplex
@Gotiplex Жыл бұрын
I made a similar code, it s very easy, but you can improve yours by saving in a txt file all the questions and answers so it can memorize what you said before. You just have to give all the content of the file for each request
@mailoisback
@mailoisback Жыл бұрын
How did you create your avatar and his speech? It sounds much better than the pyttsx's generated voice?
@KunjaBihariKrishna
@KunjaBihariKrishna Жыл бұрын
It's 11labs probably
@TungjangpoMusic-yq4rf
@TungjangpoMusic-yq4rf 3 ай бұрын
Can you edit or train speedchrecognition library so that it will able to convert our dialect/unknown language to text
@GraveUypo
@GraveUypo Жыл бұрын
that is the best animated avatar ive ever seen
@r34ct4
@r34ct4 Жыл бұрын
Great tutorial, Austin. Simple, to the point. Would it make sense to upgrade this to the Turbo model now? Also, could you do a tutorial about fine-tuning {prompt: x, response: y} to clone your friends using chat history data?
@Ai_Austin
@Ai_Austin Жыл бұрын
Absolutely. If you just change the engine variable in the open ai function of the code, you can just specify “gpt-3.5-turbo”. Then it will send your prompts to the new version of the API. Fine tuning is absolutely in the video pipeline. Have a few others ahead of it but will be creating a fine tuning tutorial here soon.
@r34ct4
@r34ct4 Жыл бұрын
@@Ai_Austin looks like fine tuning is not yet available for Turbo. I've tried with DaVinci without much success
@sebaccimaster
@sebaccimaster Жыл бұрын
@@Ai_Austin Hmm I'm getting " Engine does not exist" error... What am I overlooking?
@Gotiplex
@Gotiplex Жыл бұрын
@@sebaccimaster it s not working like that you have all another syntax for the completion. Look in internet you will find your answers
@jordanbirley3074
@jordanbirley3074 Жыл бұрын
This can be very beneficial for those who may have a disability. Very cool
@Mansyno
@Mansyno Жыл бұрын
WOW you are an incredible tutor i have been an instructor/teacher for 30 years now and i NEVER seen code writing and concepts explained so clearly and understandable like you just did here explaining and teaching code is not so trivial as many would thing and there are plenty examples for that on the net GREAT video (and note that im not even talking about the specific content itself) keep up the good work, SUBSCIBED
@obanjespirit2895
@obanjespirit2895 Жыл бұрын
c'mon bro really? you've never in you're 30 whole years of teaching never seen it explained better amongst professional teachers? I mean sure the video is informative but c'mon.
@Ai_Austin
@Ai_Austin Жыл бұрын
Try the updated tutorial with GPT 3.5 Turbo, OpenAI Whisper and an open sourced Bing AI API: kzbin.info/www/bejne/l6DOn2duq6djocU
@zeshanasad1414
@zeshanasad1414 Жыл бұрын
Bro Can You Please make a Tutorial video of making The Avatar Video Like You ❣️
@grahamulax
@grahamulax Жыл бұрын
INSANE.....already an update.... hahaa this is nuts man.
@memerified
@memerified Жыл бұрын
J
@HallucinogenUsin
@HallucinogenUsin Жыл бұрын
that ending had me cracked up, thank you for that
@HabiburRahman-lq2kt
@HabiburRahman-lq2kt Жыл бұрын
Can you make it for Android?
@NonnyStrikes
@NonnyStrikes Жыл бұрын
I actually learned some things completely unrelated to the video. Thanks dude.
@todef4ult
@todef4ult Жыл бұрын
how did you create the host animation..please explain
@DexFlex_YT-
@DexFlex_YT- Жыл бұрын
with sr.Microphone() as source: recognizer sr.Recognizer() audio = recognizer.listen(source) its highlightin "sr" as an error and when i run it it says invalid syntax, and when i try to pip install the library it says that its already installed
@FragZ999
@FragZ999 Жыл бұрын
"That sounds like NLE CHOPPA" ahahah you got me there m8 love the video
@pranshusahu7633
@pranshusahu7633 Жыл бұрын
the video was good and i followed it but al last what files Did you download while you were running the programm can you tell and if i want to convert the voice to jarvis's voice how can i do it
@eajahmed
@eajahmed Жыл бұрын
I watched your video and really enjoyed it. Please make another project like this where it will be a mobile application and whenever I call genius it will respond like Siri or google assistant. And if you make a video let me know with a little reply. In the end, I will say one thing, you are a wonderful teacher
@LEHeak
@LEHeak Жыл бұрын
Thanks Mate! Through this i was able to completly copy famous Chatbots like Siri or Alexa and thanks to the python statement "in", i was able to create a bot, who can filter my commands from whole and variable sentences. My Bot almost feels like a human teacher i can ask any question 😁 ... well ... almost ... davinci seems not to be able to tell the correct date and time since both is created from learning and not from actual live data (i asked GPT directly, Davinci refused to give me a usefull answer 😂)
@elijahonewheaton
@elijahonewheaton Жыл бұрын
This is great, and I'd love to try it, but the text is so small and kind of blurred that it's a challenge to make out the code. Will you add it to the description or pinned comment? That'd be really helpful.
@aryang1976
@aryang1976 10 ай бұрын
Can u help me? The code is all correct but there is a huge error saying that google is not an attribute in the module recognizer
@michelleokolocha5374
@michelleokolocha5374 Жыл бұрын
Hi, how do we change the voice to sound a bit like normal voice. And how do we make this work like google AI. For it to come up on our phones when we say 'Hey Genius' Or just call her name.
@MikeKleinsteuber
@MikeKleinsteuber Жыл бұрын
The big problem with this is that chatGPT is only relevant for many queries up to 2021. You really need to make this to interact with Bing Chat which has access to current data.
@Ai_Austin
@Ai_Austin Жыл бұрын
Great idea Mike, I got a Bing AI Voice Assistant Tutorial coming soon. You are right, having access to current data for our voice assistant is a huge improvement and I’m working on getting that out for you guys now! The bing voice assistant I am making will be completely free if you have beta access to bing as well. Unlimited questions.
@MikeKleinsteuber
@MikeKleinsteuber Жыл бұрын
@@Ai_Austin Sounds good, though I noticed this morning I already have both voice input and output available on Bing Chat. Don't know when Microsoft added that. Sadly you have to press the microphone icon to activate whereas it would be much more useful to be able to start with some sort of voice activation like Google assistant (especially if it could be customised). What we really need is something as interactive as in the movie 'Her' (I and many like me would pay a monthly fee for that btw) ....Keep up the good work
@VirginMostPowerfull
@VirginMostPowerfull Жыл бұрын
Even in the comments you talk like a robot my guy.
@jakesmith7428
@jakesmith7428 Жыл бұрын
@@VirginMostPowerfulllmaooo
@mazentahhan2255
@mazentahhan2255 Жыл бұрын
i'm facing some troubles running it, it requires FLAC conversion utility. any suggestions ? btw: i'm using mac..
@arjund1173
@arjund1173 Жыл бұрын
same
@nat.serrano
@nat.serrano Жыл бұрын
How did you make the avatar to talk? (The guy talking). Do you have a guide for yhat?
@Ali-Houssene
@Ali-Houssene Жыл бұрын
Thanks Austin and increase the font size in the editor next time
@El-Peje
@El-Peje Жыл бұрын
looks like you added to the code at the ending didn't show or explain that part 🤔
@Ai_Austin
@Ai_Austin Жыл бұрын
That code is 100% not needed for anything but me to create the video recording. 🤔
@KenDoStudios
@KenDoStudios Жыл бұрын
can you make one for gpt4all?
@mcfubumaki8359
@mcfubumaki8359 Жыл бұрын
Great video! I found it really informative and helpful. Thanks for sharing your knowledge and expertise with us. Looking forward to more videos like this in the future!
@RealWorldMaverick
@RealWorldMaverick Жыл бұрын
Great video. As someone else mentioned, the code is a little small. In future videos would you be able to make it larger so it's easier to read.
@orangesunlabs
@orangesunlabs Жыл бұрын
Very cool, make it and I'll use it, especially would love it if we could upload a Mid-journey etc talking avatar of our choice (or photo that could be adapted).
@braayevangelista1895
@braayevangelista1895 Жыл бұрын
SOOOOO sick dude!! can you teach us how to implement an avatar to integrate it with the gpt responses? such as yours in the video
@aiortairaan5458
@aiortairaan5458 Жыл бұрын
what are the dependencies that are required to be installed for this project?
@roxforgegames4548
@roxforgegames4548 Жыл бұрын
pyttsx3, pyaudio, openai and SpeechRecognition
@cryptombt5880
@cryptombt5880 Жыл бұрын
This is a really good video compared to most on here. Cheers
@roxforgegames4548
@roxforgegames4548 Жыл бұрын
This code works, but it is not optimal. Using speech_recognition to detect the initial command is slow because it requires sending the audio to a server, waiting for the server to process it with a large model, and then receiving the result. Ideally, a pre-trained KWS model that can recognize a single command and runs locally should be used instead.
@hugosilva5842
@hugosilva5842 Жыл бұрын
how would you do that ?
@gaurav_0369
@gaurav_0369 Жыл бұрын
Yes we're curious how to do that
@Gotiplex
@Gotiplex Жыл бұрын
@@hugosilva5842 you can use speech recognition library and it s source code to run it locally, it would be faster, but not that faster...
@alexsyld5410
@alexsyld5410 Жыл бұрын
@@hugosilva5842 this requires only a lil bit of machine learning skills and a PC that costs about 2k
@codylawson7175
@codylawson7175 Жыл бұрын
I need help when I run and say "genius" it says An error ocurred : module 'speech_recognition' has no attribute 'recognize'
@ipigtaiwan
@ipigtaiwan Жыл бұрын
change it to sr.Recognizer()
@1975nikola
@1975nikola Жыл бұрын
After Whisper API release, does it make sense to use Python function for transcribing audio? How much better is Whisper in understanding?
@Ai_Austin
@Ai_Austin Жыл бұрын
That is a great question. Ive yet to find the need to learn whisper. Its my understanding that its superior for language translation and perception of accents. It also isn’t free like the speech recognition method i showed.
@1975nikola
@1975nikola Жыл бұрын
@@Ai_Austin I was testing Whisper over the weekend. It works great - English is excellent, while even small languages are acceptable with an editor. API is not that expensive, you can transcribe a movie for around 0,50 EUR. However, there is also possibility to install it on your server, running it locally and with that it will only cost the price of the infrastructure.
@un0RRS
@un0RRS Жыл бұрын
Had to get rid of the underscore in speech_recognition to get that to work. And I had to run pip install pyaudio to get it to work, but it works. Does this thing have contextual memory? Will it remember by conversations with it? I don't see any logging or context, so I don't think it does.
@bananaear23
@bananaear23 Жыл бұрын
This is really helpful stuff, this level of quality get an insta like from me
@PdWOLFG4NG
@PdWOLFG4NG Жыл бұрын
can you make this with GPT4all? would love to see a video on how to get this running on a offline system since you dont want to be depending on their model, if it gets out of hand we need backup models
@richardtosuto
@richardtosuto Жыл бұрын
What was the process you used to create the speaking animation of your avatar?
@TitanfangArtChannel
@TitanfangArtChannel Жыл бұрын
I'm also interested in how this is done.
@maut276
@maut276 Жыл бұрын
Bro you're just awesome, Can U please make an app like this...
@Ai_Austin
@Ai_Austin Жыл бұрын
Coming very soon! 🫣
@maut276
@maut276 Жыл бұрын
@@Ai_Austin thanks broooo
@sourabhagrawal4098
@sourabhagrawal4098 Жыл бұрын
will it run the same on python or do I need to change something? because I'm trying to run it on Python and it does not run like VSC.
@cammoplayz
@cammoplayz 6 ай бұрын
i need help, it says "Python was not found; run without arguments to install from the microsoft Store, or disable this shortcut from Settings > Manage App Execution Aliases." what do i do??
@al3k
@al3k Жыл бұрын
lol the end :D nice one
@alooflemur1492
@alooflemur1492 Жыл бұрын
Hello, loved the video works wonders. Would you be able to make a video series on how to add other features? such as opening apps, opening websites, setting alarms, adding a todo list & having it speak at cirten times of the day, say you want an alarm at 7am the bot would say good morning (name) today is (Date) with the weather being (weather info) & so forth I think it would be really cool
@sushantkulkarni97
@sushantkulkarni97 Жыл бұрын
Is it possible to retain a session-like memory of previously asked questions with the API like we can do on the ChatGPT web interface? For instance if I ask "Where is the oldest tree located"? and follow it by "How tall is it?", can we make API responses retain the context?
@Ai_Austin
@Ai_Austin Жыл бұрын
Using 3.5-turbo it is possible to have contextual memory. It would definitely add some complexity and would potentially want to create a command to refresh memory if you did so.
@Gotiplex
@Gotiplex Жыл бұрын
it s very easy, you just have to save your questions and the answers in a .txt file , then you give for each new request the content of the file !
@ronaldoromerovergel8373
@ronaldoromerovergel8373 Жыл бұрын
7:45 can u do that bruh? i would like to put my voice assistent in a webpage for everyone....
@ganugaumeshbabu6144
@ganugaumeshbabu6144 Жыл бұрын
Can we add the basic functionalies like, open a website, open a file, launh/terminate a program ...etc?
@Ai_Austin
@Ai_Austin Жыл бұрын
All are actually pretty simple tasks with Python. Just a matter of adding a wake word for the new task and adding the few lines of code needed for each desired task you mentioned. ChatGPT could probably even do it for you!
@ashrafshaik79
@ashrafshaik79 Жыл бұрын
Hey, nice tutorial, but can you let me know if there is a way to change the voice of the model? If yes, how? Thanks!
@intensanium8320
@intensanium8320 Жыл бұрын
Yes we can modify parameters of the tts. Ask chatgpt how you can modify the parameters of the tts and you will have a little code snippet. just copy and paste the three lines after the initialization, you can modify the values for testing different voices and speech rates
@PdWOLFG4NG
@PdWOLFG4NG Жыл бұрын
this is good bro, thank you
@moshimoshi_04
@moshimoshi_04 Жыл бұрын
Amazing man....
@smert_rashistskiy_pederacii
@smert_rashistskiy_pederacii Жыл бұрын
Can't run it on win11, IDLE 3.11. I get ModuleNotFoundError: No module named 'speech_recognition'
@Ai_Austin
@Ai_Austin Жыл бұрын
Try copying your error message and pasting into ChatGPT. That should give you the answer to whats going on over in your end
@whitneydesignlabs8738
@whitneydesignlabs8738 Жыл бұрын
Great! Good coding,
@RomitKhilari
@RomitKhilari 9 ай бұрын
Great video austin but my program cannot access the voice of the microphone. Do I need to save my voice as a file for this program to recognize
@arjund1173
@arjund1173 Жыл бұрын
cannot access local variable 'audio' where it is not associated with a value (error) How do I fix
@carlyleezpeleta3163
@carlyleezpeleta3163 Жыл бұрын
HIII! I have a question, I am trying to implement this on my visual studio code, I followed every instruction but I’m getting an error in my visual studio terminal “Python not found” I correctly installed it and checked my windows terminal. Could you please help me out?
@Ai_Austin
@Ai_Austin Жыл бұрын
Dont use windows terminal. Its known to cause a ton of issues, because it mimics a linux operating system. Youll need to use command prompt to run the program. Make sure you added python to PATH.
@carlyleezpeleta3163
@carlyleezpeleta3163 Жыл бұрын
@@Ai_Austin Thank you! I am using this for an engineering project and appreciate your help so much. I am having another issue though - in my visual studio terminal it just keeps saying "say Genius to start recording your question..." over and over again until I stop the program, it doesn't say anything back. Do you know why?
@inanis_exe
@inanis_exe Жыл бұрын
How to make voice variations? Or somehow plug there fine tuned tacotron model?
@Ai_Austin
@Ai_Austin Жыл бұрын
Replace the TTS function with the code for the method of your choice. In the next tutorial I will be showing it with the AWS Polly Neural voice. Going to be way way better quality than pyttsx
@inanis_exe
@inanis_exe Жыл бұрын
@@Ai_Austin can’t wait to see
@HealerSama
@HealerSama Жыл бұрын
Hey! How did you make your avatar? Can you make a tutorial on that?
@MrThePavlik
@MrThePavlik Жыл бұрын
kzbin.info/www/bejne/r4i1qWCgeKl1p9E
@Codemons6
@Codemons6 9 ай бұрын
so i'm having a slight problem when i say genius after starting the bot it comes back with: An error occurred: local variable 'filename' referenced before assignment. is there a fix for this?
@ricardocosta9336
@ricardocosta9336 Жыл бұрын
Beultiful pipeline this channel.
@afiatech1807
@afiatech1807 Жыл бұрын
Great video !. what if you add the talking avatar feature to the ai assistant like the one in the video?
@Ai_Austin
@Ai_Austin Жыл бұрын
Thank you 🙏 That is the future but for now the tools to do it would make an extremely expensive and slow assistant
@deussayajin1441
@deussayajin1441 Жыл бұрын
Opa quero ver
@jonathanrose9912
@jonathanrose9912 Жыл бұрын
Currently working on integrating some Amazon API's to make the a much more usable experience. Including no wake word. I have 0 technical background but in the last 6 hours with the help of chatgpt I have a working model
@Ai_Austin
@Ai_Austin Жыл бұрын
My goal exactly with these tutorials is you add your own preferences and upgrade upon these. Super cool to hear you’re doing it with no coding background man!
@Acervorochagames
@Acervorochagames Жыл бұрын
I am creating perfect javas myself.
@GreyWind182
@GreyWind182 Жыл бұрын
Is there anyway to attach this kind of virtual assistant code to a virtual avatar that can respond like the one in this video?
@RichestMen
@RichestMen Жыл бұрын
Is it functional like google? I wanna use it to command a moving equipment or device. Does it support different languages?
@kvvlu
@kvvlu Жыл бұрын
could you redo this for the new GTP 3.5 turbo API?
@ALEXVY91128
@ALEXVY91128 Жыл бұрын
Hey, i have a question, when i run the program i got an error about the openai import: "ModuleNotFoundError: No module named 'openai'". What can i do here?
@Ai_Austin
@Ai_Austin Жыл бұрын
Hey Alexis, have you installed the openai library using pip?
@ALEXVY91128
@ALEXVY91128 Жыл бұрын
@@Ai_Austin I forgot to do that. Thanks New suscriber!
@learningMachine1.0
@learningMachine1.0 Жыл бұрын
Simply, GENIIUS🤣
@saysay1877
@saysay1877 Жыл бұрын
What application do you use
@Liborator88
@Liborator88 Жыл бұрын
Would this work on a Raspberry Pi 4? Also, is there a way to put in a stop listening phrase, for example, "genius, what is the world's tallest building, please," where it stops listening when it hears the word please, lastly can you interrupt it when it's giving a response you don't like or don't want to hear?
@Ai_Austin
@Ai_Austin Жыл бұрын
It will. Definitely requires more technical knowledge than an easier operating system like Mac. You absolutely can achieve both of those goals with some minor python code tweaks. Ask ChatGPT, that will be an easy task for it!
@venkateshan.p1940
@venkateshan.p1940 7 ай бұрын
Bro about api key ,should I buy that or can you enlight me please
@Ai_Austin
@Ai_Austin 7 ай бұрын
Yes if you did not get free usage credit, that means they are not currently doing that in your country. You can start with $5 and that will get you a lot of development usage.
@InspiredInsights4U
@InspiredInsights4U Жыл бұрын
This is a good video it could be even better though with a release of GPT 3.5 turbo if you would take and show this again using GPT 3.5 turbo and whisper I think he would have a lot better response and a lot of people will really jump on wanting to do this. Thanks.
@Ai_Austin
@Ai_Austin Жыл бұрын
I have been researching Whisper. Its barrier to entry is a lot higher. Meaning if you want to run Whisper without having to pay for every question to transcribe, it needs to be done locally. Which puts you in the position of either needing a PC with 10+ GB of video ram. I also have not seen any evidence that the whisper api performs better in transcription than google speech recognition. OpenAI is the hype but I don’t want to make people feel obligated to shell out money for something that is currently possible for free. If one needed offline transcribing and has a beast of a pc to power the python program, Whisper would be a great choice.
@InspiredInsights4U
@InspiredInsights4U Жыл бұрын
I think today’s computers are probably powerful enough to handle text to speech I am a blind individual and I use several apps on my phone on my computer that dude just this kind of conversion and they’re not high power apps or high power computer. Some of them sent off to the Internet for processing but one of the things that could be done. If CPU horsepower is a real concern is push it off to the GPU most computers have Decent graphics processing units that would process much faster than a CPU ever could and it doesn’t take a lot of code to do that. I do think there’s a little more involved in writing code but I don’t think it’s any strong barrier. I think it’s just something Hass to be learn how to do. I’m in the process of trying to learn some of these things myself and I don’t see it as difficult as what you think it might be Again being blind it’s a little hard for me to quickly ramp up to the stuff but I’m getting there
@arjunpatel8181
@arjunpatel8181 Жыл бұрын
Ai Austin, I always run into an error when I debug with pyaudio. I have it installed properly, but its not working.
@arjunpatel8181
@arjunpatel8181 Жыл бұрын
Could not import the PyAudio C module 'pyaudio._portaudio'.
@Ai_Austin
@Ai_Austin Жыл бұрын
Its possible you have multiple versions of python installed and its not linked to the python you are running the program with. Have you tried copying and pasting the whole error code into ChatGPT? (use GPT-4 if you havent tried already)
@GasparGasparQuintero
@GasparGasparQuintero Жыл бұрын
Is some way to change the voice of the asistant? and for use other lenguages?
@Ai_Austin
@Ai_Austin Жыл бұрын
Yes there is. Checkout the newer voice assistant tutorial (my most recent upload)
@ES-uo8to
@ES-uo8to Жыл бұрын
i don't know anything about python but this is fascinating....i would also like to store all my chats realtime in some sort of folder and hierarachy so that i may catalogue them or use them later for referal
@Ai_Austin
@Ai_Austin Жыл бұрын
One idea would be to create an SQL database or a spreadsheet. Then have your python program write that data every time you ask a question. Those type of tasks python is really fast at, so you wont even notice any slowing of your program.
@ES-uo8to
@ES-uo8to Жыл бұрын
@@Ai_Austinthks whom would i commission to do that ...:)
@devawratvidhate9093
@devawratvidhate9093 Жыл бұрын
Which AI tool you use avtar and voice-over Thanks for video
@f4r3l0
@f4r3l0 Жыл бұрын
amazing! congrats
@robbieconnell4301
@robbieconnell4301 Жыл бұрын
Do you have to install the libraries before hand? or will it automatically do it? Complete novice when it comes to coding.
@robbieconnell4301
@robbieconnell4301 Жыл бұрын
Ok I installed them, but now I just get a text response and no audio playback.. dont mind me...
@Ai_Austin
@Ai_Austin Жыл бұрын
Definitely sounds like the pyttsx3 code isnt setup properly. Copy the script, paste it in ChatGPT and ask “why is this code not speaking the response from OpenAI?”
@robbieconnell4301
@robbieconnell4301 Жыл бұрын
@@Ai_Austin Thanks for your response. sorry only just seen
@ishiplop8920
@ishiplop8920 Жыл бұрын
Hey how do i get the actual openai library to show up in the vs code?
@alberttakaruza5612
@alberttakaruza5612 Жыл бұрын
Can some explain were we are headed with but me its gonna be wild 😂 loved the Dis-song😅🎉
@milindkulkarni7101
@milindkulkarni7101 Жыл бұрын
Do we already have an app on Google or apple Play Store that works as a gpt voice assistant and speaks using out of choice Avatar of choice?
@parthkatke976
@parthkatke976 Жыл бұрын
What tool are you using to voice this video 🧐?
@theacend1
@theacend1 Жыл бұрын
elevenlabs
@Yo_whatUP613
@Yo_whatUP613 Жыл бұрын
I’m getting a prompt saying no module named speech recognition, tried pip install speech_recognition but it couldn’t find it, what can I do?
@Ai_Austin
@Ai_Austin Жыл бұрын
You install with 'pip install SpeechRecognition'
@Yo_whatUP613
@Yo_whatUP613 Жыл бұрын
@@Ai_Austin thanks it downloaded
@eschoniger
@eschoniger Жыл бұрын
What do you use to generate your animated AI avatar and voice for that avatar (narrating the video)?
@liambyliam
@liambyliam Жыл бұрын
yeah! how did you code the talking avatar
@skylier
@skylier 8 ай бұрын
In case anyone is wondering as of today (2024) basically everything is outdated in this video unfortunately ;-; Hopefully this can save some people from trying and failing.
@Ai_Austin
@Ai_Austin 8 ай бұрын
Watch the newer tutorials. Code tutorials don't last for ever when they use 3rd party API's! My newer tutorials are far better than this one anyways!
@Daniel.Kovacs
@Daniel.Kovacs 8 ай бұрын
That's me 😂
@8O8DRAX
@8O8DRAX Жыл бұрын
when will these be available for Macs?
@Ai_Austin
@Ai_Austin Жыл бұрын
Its always been. Im developing these on a mac.
@simpled5121
@simpled5121 Жыл бұрын
Big question so I’m trying to run the thing but every time I do it it says that the input.wag it’s not working what can I do?
@Ai_Austin
@Ai_Austin Жыл бұрын
It should be input.wav
@seb_gibbs
@seb_gibbs Жыл бұрын
I've done the same in PHP using a few different APIs and streaming the data as to reduce the latency as much as possible, but its still laggy. Reducing the lag between a question and response is the tricky bit.
@Ai_Austin
@Ai_Austin Жыл бұрын
Id check out my new Bard voice assistant tutorial! Its faster than openai's api's and free. The past week I have been using Bard way more than chatgpt. Its just better for fact based responses that need to check recent internet data to verify its answers. And somehow faster than chatgpt without back-searching google.
@sarkis3791
@sarkis3791 Жыл бұрын
@@Ai_Austin help please it says An error occured: You exceeded your current quota, please check your plan and billing details.
@bipinpandit6000
@bipinpandit6000 3 ай бұрын
I put the API key in the function but the code is not running, please help thanks
REAL or FAKE? #beatbox #tiktok
01:03
BeatboxJCOP
Рет қаралды 18 МЛН
VIP ACCESS
00:47
Natan por Aí
Рет қаралды 30 МЛН
AutoGPT Tutorial - More Exciting Than ChatGPT
23:56
Santrel Media
Рет қаралды 760 М.
Create Your Own AI Animated Character (step by step)
15:21
Sharp Startup
Рет қаралды 1,2 МЛН
Creating Jarvis powered by OpenAI and Python | ChatGPT
18:40
CS Coach
Рет қаралды 1 МЛН
Build a LOCAL AI Web Search Assistant with Ollama
26:57
Ai Austin
Рет қаралды 4,2 М.
Python AI Voice Assistant & Agent - Full Tutorial
33:10
Tech With Tim
Рет қаралды 92 М.
Build a Gemini Voice Assistant in Python
23:03
Ai Austin
Рет қаралды 23 М.
Create a GPT4ALL Voice Assistant in 10 minutes
10:33
Ai Austin
Рет қаралды 76 М.
Learn JavaScript in 2025 and Change Your Life (Introduction)
9:08