I Built an A.I. Voice Assistant using PyTorch - part 1, Wake Word Detection

  Рет қаралды 433,272

The AI Hacker

The AI Hacker

Күн бұрын

Пікірлер: 357
@ItachiUchiha-nx2sw
@ItachiUchiha-nx2sw 4 жыл бұрын
First few minutes : Alright, this sounds so cool Middle part: Da fuq Last few minutes : Alright, this sounded cool
@theaihacker777
@theaihacker777 4 жыл бұрын
😂
@christopherdimitrov1652
@christopherdimitrov1652 3 жыл бұрын
Hahahaha
@vinayak354
@vinayak354 3 жыл бұрын
Hahaha
@AfafPrinceOSH
@AfafPrinceOSH 3 жыл бұрын
@@theaihacker777 is there any way u could change the voice of Google assistant to any random voice?
@GeometryDashEndermaster
@GeometryDashEndermaster 4 жыл бұрын
You're like an alternate reality Michael Reeves
@soon3794
@soon3794 4 жыл бұрын
Basically a less suicidal version lol
@dareokoski8158
@dareokoski8158 3 жыл бұрын
@@soon3794 he is still yung, give him time XD
@TeamVexVideos
@TeamVexVideos 3 жыл бұрын
The other Micheal isn't as good with Man's AI. He just uses like google API or existing software. And Micheal Reeves just does comedy not education.
@sgodsell2
@sgodsell2 3 жыл бұрын
At 10:00 you said two words that sound similar, like MOLLY, and FOLLY, FALLEY. If you were to include those 2 or 3 words in your model data under a different category. Then they will never trigger when you say any of those words, except when you say hey wally.
@mohamedfasil4932
@mohamedfasil4932 2 жыл бұрын
The weather is M####F#### hot I thought that and in deep thinking this doesn't sounds right
@user-fj4ih3lo9f
@user-fj4ih3lo9f 4 жыл бұрын
Could you make a more detailed tutorial? I couldn't find any other videos on how to make an AI Voice Assistant, i really liked the vid altough it was sometimes hard to follow. Would really enjoy a full detailed series on this :D
@bryanfeliciano4102
@bryanfeliciano4102 3 жыл бұрын
The best way to learn is to mess with it my dude. Go into the GitHub and read the code, and start writing your own following his example but alter it to suit your tastes .
@johnhandley1870
@johnhandley1870 4 жыл бұрын
I’m watching this video on my iPad and when you said “Hey Siri”, Siri woke up... By the way, I’d be really interesting in seeing a video in which you explain how to set up a computer to carry out Machine Learning tasks. :)
@theaihacker777
@theaihacker777 4 жыл бұрын
Would love to do that!
@JorgeHernandez-iw2fd
@JorgeHernandez-iw2fd 3 жыл бұрын
How does someone even get to this level? I'm assuming years of practice and stuff but what/how did you practice?
@shubhamthapa7586
@shubhamthapa7586 3 жыл бұрын
no one year is enough
@robinranabhat3125
@robinranabhat3125 3 жыл бұрын
While internet is full of AI guru's teaching basics with some slides and a jupyter notebook, this guy actually teaches ML with a production level code. Why are you underrated !!
@p._7555
@p._7555 3 жыл бұрын
True. On the learning curve, we need basic and high level AI teaching too
@dillonridder8737
@dillonridder8737 4 жыл бұрын
"Apple's okay" lol
@schwarzarbyter
@schwarzarbyter 3 жыл бұрын
thats exactly when this video got its 72nd dislike.
@theaihacker777
@theaihacker777 4 жыл бұрын
thanks for the support!
@leninbabu5797
@leninbabu5797 4 жыл бұрын
Its been cool to see U Can we make this model using raspberry Pi zero !!
@maritzadelascasas2727
@maritzadelascasas2727 4 жыл бұрын
the mot ovios wake word is wake up a.i.
@elliotmarks06
@elliotmarks06 Жыл бұрын
This project looks super cool! I'm a little late to the party, but I think this would be awesome to revisit with the new AI chat tools! Especially something like GPT-Neo or the other open-source implementations.
@andreashon
@andreashon 4 жыл бұрын
This video is exactly what I was looking for. All other voice assistant youtube guides use shitty Google services and other proprietary sources. Thank you. Looking forward for next vids on this topic. Also that'd be interesting if you reveal how much time have your machine spent on all that learning.
@theaihacker777
@theaihacker777 4 жыл бұрын
Next video coming up soon! For wakeword, training was really fast. only spent like 30 minutes training.
@leif1075
@leif1075 Жыл бұрын
@@theaihacker777 Was this process mostly fun and enjoyable? If not how did you not give up when it got hard and not get bored and frustrated? Thanks for sharing.
@rangefreewords
@rangefreewords 2 жыл бұрын
Awesome! I was looking for a totally offline LAN based smart A.I. like what you just presented Objective: To control everything on my sailboat, take helm, drop anchor, play music or movies from my pi based server and work alongside my autopilot systems and chart plotter. I know often during my voyages I won't be accessing any internet but, I want to have all the same ubiquitous control as a smart home, etc. I am trying to source as much as I can from KZbin to build a decent system. Keep up the awesome work!
@subhajitkundu7546
@subhajitkundu7546 2 жыл бұрын
Hey, good day mate, The project that you talked about sounds awesome. I am just checking in to know how the project is coming along and where are you headed with this project currently.
@danielogunlolu
@danielogunlolu 2 жыл бұрын
I am working on something similar on channel. Kindly check it out
@mathewpatterson2187
@mathewpatterson2187 Жыл бұрын
Ahoy there! I'm also checking in, how's it going with the project sounds awesome and very reminiscent of what I want to do! Have you had much success?
@daanzap
@daanzap Жыл бұрын
I have the exact same idea! It's going to feel a bit like being on the enterprise.
@justsomeguywithtattoos6267
@justsomeguywithtattoos6267 3 жыл бұрын
This could be easily applied for translations, so that you can have an earplugs that instantly translates what someone is saying to you
@gamecraftczjaajenomja1057
@gamecraftczjaajenomja1057 2 жыл бұрын
Amazing video! Finally someone not just showing some random jupyter notebook. I love how you show the real problems: not enough wake word samples, voice streaming, long training times, etc. Continue if possible, I would greatly appreciate it!
@ramoncaceres4399
@ramoncaceres4399 3 жыл бұрын
Pretty interesting. For about 2 years I’ve been obsessed with turning my house into an AI assistant. Yet have the voice Overlay of ( BT from titanfall ). Yet pretty hard to do that.
@JohnSmith-ox3gy
@JohnSmith-ox3gy 2 жыл бұрын
Why mess with perfection?
@shallinkumar9042
@shallinkumar9042 3 жыл бұрын
I'm getting this error in dataset.py (extension.py", line 14 warnings.warn('torchaudio C++ extension is not available.') )
@leisana4097
@leisana4097 3 жыл бұрын
Extremely extremely intelligent AI. You asked what next to try - Can you try - Self driving car with Raspberry Pi and Pytorch. A small rover
@AshrafAli-hg4zd
@AshrafAli-hg4zd 3 жыл бұрын
Could u please help me with name of ur war Machine
@RichardBaileyrichoncode
@RichardBaileyrichoncode 4 жыл бұрын
Looking forward to next episodes.
@adarshvinayak
@adarshvinayak 4 жыл бұрын
Your videos are amazing man. You've just earned a fan. By the way, I'd love to see you make the speech recognition model next.
@theaihacker777
@theaihacker777 4 жыл бұрын
It seems that speech recognition is popular!
@microgamawave
@microgamawave 2 жыл бұрын
You can make a video about gait recognition biometrics in python recognized you from your walk model ????
@dirtydan69
@dirtydan69 3 жыл бұрын
Where can I talk to you about help on building my own AI? I’m still learning how to code and would mean a lot to me if you gave me a helping hand
@user-tu9ox4hj9g
@user-tu9ox4hj9g 3 жыл бұрын
I would like to hire you for a simple project and will pay substantial for you to implement speech recognition into python for me. Please contact for further detail if we can link to another platform to communicate
@samueldemissie2403
@samueldemissie2403 4 жыл бұрын
imagine running the trainer script on a 4gb ram pc😂😂 i need to buy that pc
@blandcoffeeamv4107
@blandcoffeeamv4107 4 жыл бұрын
I am currently building a bot for my thesis and want to test out everything´s working. i don´t have the parts at home and it´s kinda late to order the parts. i should be able to test it on my laptop too, right?
@cheenamaejafar
@cheenamaejafar 3 жыл бұрын
" and I had to sell my left kidney for this" - Hilarious!
@matthewfelgate
@matthewfelgate 4 жыл бұрын
Wouldn't a trigger word with an X or K make it easier to detect? (Like Alexa or Ok Google)
@theaihacker777
@theaihacker777 4 жыл бұрын
Probably...
@baskett98
@baskett98 4 жыл бұрын
Install an Anaconda and you're set to start. Get more modules and libraries as and when required.
@aaronchantrill7338
@aaronchantrill7338 4 жыл бұрын
I use "MagicVoice" which seems to work well. Does that count as using a 'K'? How are you choosing 'X' and 'K'?
@luvsec5469
@luvsec5469 3 жыл бұрын
The Michael Reeves we always wanted but never got. Until now.
@anshpatel8083
@anshpatel8083 3 жыл бұрын
“ Apple’s okay. “ Im subscribing this channel
@backinyourcommentsectionag3191
@backinyourcommentsectionag3191 3 жыл бұрын
eleven minutes of off-brand Michael reeves speaking greek
@hs4lhp828
@hs4lhp828 3 жыл бұрын
Interesting video. Looks like fun. You're smart af. In other news, I've always suspected that I'm dumb af. This is now confirmed.
@jeremyuzan1169
@jeremyuzan1169 3 жыл бұрын
Hi ! congrats for your work. How do you install Pytorch properly in a Raspberry Pi bro ? Thanks a lot :) Jeremy
@PritishMishra
@PritishMishra 3 жыл бұрын
You are a genius.. Subscribed !!
@MatejMikulas
@MatejMikulas 11 ай бұрын
can i make it in different language?
@ali-g
@ali-g 4 жыл бұрын
Oh man, you are amazing! Just inspired me, thanks for the great content.
@Pablo-wf8dh
@Pablo-wf8dh 2 жыл бұрын
Im in this video and I dont like it
@vulcanviduus252
@vulcanviduus252 4 жыл бұрын
I'm only 48 seconds in... but isn't that a picroft? The pi version of Mycroft
@aaronchantrill7338
@aaronchantrill7338 4 жыл бұрын
PiCroft still uses a remote server to do the heavy lifting. Actually, the last time I checked with Mycroft, they were just passing your voice data through to Google STT Cloud. So if you are using Mycroft, you are still sending random bits of audio to a third party to process, and because they also do the Text to Speech on their remote hardware, both your request and the response are being handled by a third party. This requires quite a bit of trust on your part (it appears that the actual intent parsing is happening locally, but if both the STT and TTS are remote, then what is the benefit?) This is doing all the processing on a raspberry pi. Hardware you control. Also, I think the author's intent is more about showing how this works in a concrete way. He's providing a jumping off point.
@mrCetus
@mrCetus 2 жыл бұрын
Your videos are amazing man. You've just earned a fan. By the way, I'd love to see you make the speech recognition model next.
@lukewhatley8043
@lukewhatley8043 4 жыл бұрын
Awesome video! Do you know if the dataset you decide to train it on has to be in .wav format? So close to getting your example working! Let me know if theres somewhere I can ask some questions regarding your code. Again great video man!
@theaihacker777
@theaihacker777 4 жыл бұрын
Hey Luke! If you have discord, join the discord server and I can help you there. The link is in the description.
@phoenix1799
@phoenix1799 3 ай бұрын
Bro I use a setup with 128GB RAM with RTX 4080 16GB, RTX 3060 OC 12GB, RTX 2060 super 8GB on it with 5TB SSD M2 but I use as a open AIR setup for faster cooling. But you cabinet setup looks very efficient and cool. Could you send me the link for it
@Nitro-Infusions
@Nitro-Infusions 3 жыл бұрын
My name is Michael too
@தமிழோன்
@தமிழோன் 3 жыл бұрын
I wonder why your channel is still not famous. 🤔You deserve millions of subscribers!!!
@AdityaChauhan-mk7lp
@AdityaChauhan-mk7lp Жыл бұрын
Michael reeves from sarojni
@Rottingflare
@Rottingflare 3 жыл бұрын
This looks like an awesome project, can't wait to see more development!
@ZolekaMncwabe
@ZolekaMncwabe 3 жыл бұрын
I'm soo high, I understand everything....but tomorrow I wont🤣🤣🤣🤣🤣😭😭😭😭😭 fuck
@sodapopcowboy8620
@sodapopcowboy8620 3 жыл бұрын
Good news I understand this. Bad News I am overly perfectionistic.
@mashudahmedtalukdar3005
@mashudahmedtalukdar3005 3 жыл бұрын
where is the next part we are waiting. please make more videos about AI Assistant
@DIYRobotGirl
@DIYRobotGirl 11 ай бұрын
But can we make it sing or match voice with pitch.h like we do with the buzzer. How do we get an AI voice to sing.😮
@aperson1181
@aperson1181 9 ай бұрын
Will it support other languages? I am trying to help my elderly mom, who speaks Ukrainian/Russian. I tried to have her speak into a Windows PC in Russian to transcribe this text and then translate it to English. Is there a tool for this? For some reason, Windows does not support Russian speech recognition. Yes, offline is a Big plus.
@harjunmnath
@harjunmnath 3 жыл бұрын
common man, give us the next video of this series we have been waiting too long
@sreeram9220
@sreeram9220 2 жыл бұрын
I'm not gonna lie when I hear your one minute speech I find some hope
@ryandeguara1983
@ryandeguara1983 Жыл бұрын
Hi i tried joining your discord link but it does not let me (maybe server is full?), could I please be given a spot as I would love to join the server as I am looking to complete the project myself. Thanks
@bradc6056
@bradc6056 4 жыл бұрын
I haven’t seen the rest yet, but have you thought of creating a blacklist of all the words that sounds like Wally, and that should increase overall accuracy.
@tuandroidgeneral2091
@tuandroidgeneral2091 4 жыл бұрын
how did you get so fast response
@aryagupta6965
@aryagupta6965 3 жыл бұрын
Part 2?
@devdoctor6351
@devdoctor6351 3 жыл бұрын
Part 2?
@vinijajain2909
@vinijajain2909 2 жыл бұрын
Check out more notes here if you're looking for details on speech AI: www.linkedin.com/feed/update/urn:li:activity:6954297492020613120/
@egs-zs8-127
@egs-zs8-127 4 жыл бұрын
Great video! Thank you so much!
@THEREAL8030
@THEREAL8030 2 жыл бұрын
Hey there . What if we use computer motherboard and good processor, rams and SSD .so after it will work fast or will it even work or not . .
@jenneralkiller7132
@jenneralkiller7132 2 жыл бұрын
I'm getting into Python and machine learning man any chance you could be a good teacher? I'll pay you. I need this from a robotics engineering and I'd rather have a mentor so if you're willing to teach show me where to start I mean I understand I need to learn Python but right now I'm homeless and I'm starting to learn Python at the same time I got screwed over with my financial aid doing my robotics cuz my family was being an asshole the older guy I'm just trying to make my way I got a job going for 25 an hour sucks but something's better than nothing. So maybe if you mentioned me in this I can mentor you and robotics
@1minutechess
@1minutechess 3 жыл бұрын
Hey I am robotic researcher from India how to contact with you please help
@doddianil6946
@doddianil6946 8 ай бұрын
my project is making authorized voice assistant it means the assistant respond only owner of the device commands only ,it not respond others commands by finding his voice. for this my idea is frist store user voice and give any command it compare stored voice with live voice both voices are match it respond otherwise not respond how to make it plz explain 😍
@hussainbhavnagarwala2596
@hussainbhavnagarwala2596 4 ай бұрын
Can we use CNN instead of RNN here for the classification of MFCC images?
@loloxx7460
@loloxx7460 2 жыл бұрын
I just download soft soft. Are you using the paid version or the free tutorial one? Because your screen looked way different than mine.
@iventab2052
@iventab2052 Жыл бұрын
Please make video for live remove speech around you will you listen to people for sample there's a list of music and any one playing on of this music this script will remove this and listen clear speech from person you talking with the music list could be songs or car voice etc...
@AshrafAli-hg4zd
@AshrafAli-hg4zd 3 жыл бұрын
Hi, My name is Ashraf Ali and I am from India could you please help me with my project
@wojtekgame
@wojtekgame 2 жыл бұрын
For now, i don't own an speaker running Python code...
@melaniedavis6379
@melaniedavis6379 2 жыл бұрын
Love this, but you see what had happened was I'm in confusion the words you say are too technical for my brain 😂
@chrisw1462
@chrisw1462 3 жыл бұрын
Sounds like a great project. Too bad you believe profanity adds to the quality of your work. It does not.
@ErgsYT
@ErgsYT 3 жыл бұрын
My boy went ham coding but didn't show us how he got the code in the damn thing 😂😂😂
@saltofearth4902
@saltofearth4902 4 жыл бұрын
Omgurd your awesome.. First five min freaking had me laughing hard!
@Mrprashu99
@Mrprashu99 2 жыл бұрын
nice tuto Saved , tNice tutorials should be a lot Nice tutorialgher. Thank you.
@archlunarwolf
@archlunarwolf 3 ай бұрын
He sticks it to Amazon and then goes ahead and orders the SD card from Amazon...
@goldengun1214
@goldengun1214 Жыл бұрын
Woukd this work if i pprogramed it to sound like either goku or vegeta??? Ive thought about doing this for a big project i want to do
@TuxTechLabs
@TuxTechLabs 3 жыл бұрын
How to connect that with a Chatbot present on Private network computer . Please this will be a fucking new product
@wmacosx
@wmacosx 3 жыл бұрын
Hi, thanks for the great video, can you tell me how much power/cpu this wake-word implementation consumes? I've always wanted to build something like that but I'm worried about the PI having to constantly process audio to catch that wake-word that I would use 2 or 3 times a day. I was thinking that maybe for the wake-word part, some specialized hardware would be preferable, so that the PI will be under load only for a short period of time after each wake-word activation.
@Elian-
@Elian- 4 жыл бұрын
Great video! High qualiy, entertaining and inspiring
@aylinkaradag8487
@aylinkaradag8487 2 жыл бұрын
Anthony Angel first of all, I said no homo. Second, I didn’t understand how that softed gay at that ti and I still don’t understand how it
@harleyehuffinejr2082
@harleyehuffinejr2082 3 жыл бұрын
Did you going to be the next Elon Musk but better stride stay alive you have no choice but the lift your voice this is Ohio Harley e 9
@TheVikingActual
@TheVikingActual 3 жыл бұрын
I just want to make an AI I can talk to, I know it sounds sad It is sad 😔
@simonedebellis6783
@simonedebellis6783 3 жыл бұрын
does exist any way to run it on windows? i get an error where "torchaudio C++ extension is not available"
@payamtorna2
@payamtorna2 2 жыл бұрын
I bet you play video games on your war machine!
@COOLIGANFN
@COOLIGANFN 2 жыл бұрын
Im trying to build a robot and have some data that could be used for a good a.i can you help?
@daynedement2645
@daynedement2645 2 жыл бұрын
Is there a email I contact you I have a disability and would like your help to build one
@NathanaelNewton
@NathanaelNewton 2 жыл бұрын
"Ok, Now that we got the code.." Wow.. that was so information dense.. SUBBED AND BELL WOW.. I'm going to learn a lot here I think :)
@Omkar-ey3ls
@Omkar-ey3ls 4 жыл бұрын
really nice video. may I ask which resources you used to learn code that good in python for neural networks using OOP. i am freshman in computer science and wanted to get good at writing python code for neural networks using OOP. Thanks.
@theaihacker777
@theaihacker777 4 жыл бұрын
Hey! If you look into PyTorch, it’s very Object oriented paradigm to write neural networks
@esimitley4729
@esimitley4729 3 жыл бұрын
Where Practice Weight,? Because It is just for fun .
@arisoda
@arisoda 3 жыл бұрын
HOW LONG DID IT TAKE TO TRAIN ?????????????????????? approximately...?
@aayushbajaj2260
@aayushbajaj2260 3 жыл бұрын
this is insane. thank you for building this.
@sunilshewale913
@sunilshewale913 3 жыл бұрын
can u make google in alexa. like if i ask alexa to search from google website ..
@kalyanstock8058
@kalyanstock8058 Жыл бұрын
Wow...can you do a video on text to speech for any voice?
@chriz__3656
@chriz__3656 4 ай бұрын
is it possible to build this on raspberry pi 3 plezzz reply 😇
@adeelmalik6257
@adeelmalik6257 2 жыл бұрын
Yo does somebody has the source code because in his github it is nowhere to be found!
@fortunateprogrammer7116
@fortunateprogrammer7116 3 жыл бұрын
Personal AI assistant using python "JARVIS" : kzbin.info/www/bejne/anicdnidrL98b7M
@obviousabsurdity3181
@obviousabsurdity3181 9 ай бұрын
dont waste your time going to this link, he doesnt show any code, just a demo of his home version of jarvis :(
@user-mr8cw4ud8n
@user-mr8cw4ud8n 2 жыл бұрын
He had when he "pitched down the Nice tutorialgh hats at the end of the phrase. "
@askminrui
@askminrui Жыл бұрын
You dont need all these just for a wake up word detection.
@helloqasim
@helloqasim 3 жыл бұрын
Next time make the video using your voice assistant
@HappiFix
@HappiFix 3 жыл бұрын
daaaaayum good stuff homie g gangsta
@jojo-gg1iz
@jojo-gg1iz Жыл бұрын
Damn he totally abandoned this project
@thom2503
@thom2503 4 жыл бұрын
Great video, liking the format
@seesah
@seesah 4 жыл бұрын
this is so amazing!
I Built a Personal Speech Recognition System for my AI Assistant
16:32
Players vs Corner Flags 🤯
00:28
LE FOOT EN VIDÉO
Рет қаралды 76 МЛН
Inside Out 2: ENVY & DISGUST STOLE JOY's DRINKS!!
00:32
AnythingAlexia
Рет қаралды 13 МЛН
Every Developer Needs a Raspberry Pi
27:27
Sam Meech-Ward
Рет қаралды 627 М.
But what is a neural network? | Chapter 1, Deep learning
18:40
3Blue1Brown
Рет қаралды 17 МЛН
I Built a CoPilot+ AI PC (without Windows)
12:50
Jeff Geerling
Рет қаралды 381 М.
Raspberry Pi AI: Picroft Voice Assistant
21:34
ExplainingComputers
Рет қаралды 165 М.
Voice Recognition Raspberry PI and Arduino UART communication
15:17
AI, Machine Learning, Deep Learning and Generative AI Explained
10:01
IBM Technology
Рет қаралды 275 М.
Run your own AI (but private)
22:13
NetworkChuck
Рет қаралды 1,5 МЛН