I Built an A.I. Voice Assistant using PyTorch - part 1, Wake Word Detection

  Рет қаралды 426,054

The AI Hacker

The AI Hacker

Күн бұрын

This is a series where I walk through the engineering steps and challenges on how to build an Artificial intelligence voice assistant, similar to google home or Amazon Alexa, with Python and PyTorch on a Raspberry Pi. I leverage the latest machine and deep learning techniques to achieve this.
In this video, I show how you can build a wake word detector (keyword spotting) using recurrent neural networks specifically LSTMs.
⭐ Play and Experiment With the Latest AI Technologies at grandline.ai ⭐
Github:
github.com/LearnedVector/A-Ha...
Parts:
raspberry pi 4 model b - www.amazon.com/gp/product/B07...
ReSpeaker 2 mic array hat - www.amazon.com/gp/product/B07...
portable mini speaker - www.amazon.com/gp/product/B07...
micro sd - www.amazon.com/gp/product/B07...

Пікірлер: 357
@ItachiUchiha-nx2sw
@ItachiUchiha-nx2sw 4 жыл бұрын
First few minutes : Alright, this sounds so cool Middle part: Da fuq Last few minutes : Alright, this sounded cool
@theaihacker777
@theaihacker777 4 жыл бұрын
😂
@christopherdimitrov1652
@christopherdimitrov1652 3 жыл бұрын
Hahahaha
@vinayak354
@vinayak354 2 жыл бұрын
Hahaha
@AfafPrinceOSH
@AfafPrinceOSH 2 жыл бұрын
@@theaihacker777 is there any way u could change the voice of Google assistant to any random voice?
@robinranabhat3125
@robinranabhat3125 3 жыл бұрын
While internet is full of AI guru's teaching basics with some slides and a jupyter notebook, this guy actually teaches ML with a production level code. Why are you underrated !!
@p._7555
@p._7555 3 жыл бұрын
True. On the learning curve, we need basic and high level AI teaching too
@gamecraftczjaajenomja1057
@gamecraftczjaajenomja1057 Жыл бұрын
Amazing video! Finally someone not just showing some random jupyter notebook. I love how you show the real problems: not enough wake word samples, voice streaming, long training times, etc. Continue if possible, I would greatly appreciate it!
@user-fj4ih3lo9f
@user-fj4ih3lo9f 3 жыл бұрын
Could you make a more detailed tutorial? I couldn't find any other videos on how to make an AI Voice Assistant, i really liked the vid altough it was sometimes hard to follow. Would really enjoy a full detailed series on this :D
@bryanfeliciano4102
@bryanfeliciano4102 3 жыл бұрын
The best way to learn is to mess with it my dude. Go into the GitHub and read the code, and start writing your own following his example but alter it to suit your tastes .
@GeometryDashEndermaster
@GeometryDashEndermaster 4 жыл бұрын
You're like an alternate reality Michael Reeves
@soon3794
@soon3794 3 жыл бұрын
Basically a less suicidal version lol
@dareokoski8158
@dareokoski8158 3 жыл бұрын
@@soon3794 he is still yung, give him time XD
@TeamVexVideos
@TeamVexVideos 3 жыл бұрын
The other Micheal isn't as good with Man's AI. He just uses like google API or existing software. And Micheal Reeves just does comedy not education.
@sgodsell2
@sgodsell2 2 жыл бұрын
At 10:00 you said two words that sound similar, like MOLLY, and FOLLY, FALLEY. If you were to include those 2 or 3 words in your model data under a different category. Then they will never trigger when you say any of those words, except when you say hey wally.
@mohamedfasil4932
@mohamedfasil4932 2 жыл бұрын
The weather is M####F#### hot I thought that and in deep thinking this doesn't sounds right
@Rottingflare
@Rottingflare 3 жыл бұрын
This looks like an awesome project, can't wait to see more development!
@Elian-
@Elian- 4 жыл бұрын
Great video! High qualiy, entertaining and inspiring
@mrCetus
@mrCetus 2 жыл бұрын
Your videos are amazing man. You've just earned a fan. By the way, I'd love to see you make the speech recognition model next.
@mileshall5795
@mileshall5795 Жыл бұрын
DUDE!!!!....You are a total BOSS!!! Thanx man. you are WAAYY better at this instructional thing than established channels/KZbinrs that (for some reason) have more subscribers, etc. Keep that stuff UP!! You are just the dude I was looking for.
@RichardBaileyrichoncode
@RichardBaileyrichoncode 4 жыл бұрын
Looking forward to next episodes.
@elliotmarks06
@elliotmarks06 Жыл бұрын
This project looks super cool! I'm a little late to the party, but I think this would be awesome to revisit with the new AI chat tools! Especially something like GPT-Neo or the other open-source implementations.
@ali-g
@ali-g 4 жыл бұрын
Oh man, you are amazing! Just inspired me, thanks for the great content.
@johnhandley1870
@johnhandley1870 4 жыл бұрын
I’m watching this video on my iPad and when you said “Hey Siri”, Siri woke up... By the way, I’d be really interesting in seeing a video in which you explain how to set up a computer to carry out Machine Learning tasks. :)
@theaihacker777
@theaihacker777 4 жыл бұрын
Would love to do that!
@fahdciwan8709
@fahdciwan8709 4 жыл бұрын
Keep it coming Michael !!! Thanks a ton!!
@aayushbajaj2260
@aayushbajaj2260 2 жыл бұрын
this is insane. thank you for building this.
@PrashantKumar-do5fk
@PrashantKumar-do5fk Жыл бұрын
Been watcNice tutorialng your vids for a good few weeks now, learning new sNice tutorialt each day. my worksoftow has improved so much since watcNice tutorialng
@prakashupadhyay9529
@prakashupadhyay9529 4 жыл бұрын
Loved your explanation and flow!
@thom2503
@thom2503 4 жыл бұрын
Great video, liking the format
@theaihacker777
@theaihacker777 4 жыл бұрын
thanks for the support!
@leninbabu5797
@leninbabu5797 3 жыл бұрын
Its been cool to see U Can we make this model using raspberry Pi zero !!
@maritzadelascasas2727
@maritzadelascasas2727 3 жыл бұрын
the mot ovios wake word is wake up a.i.
@etienneekpo348
@etienneekpo348 4 жыл бұрын
Cool Mike. Thanks for sharing !
@leisana4097
@leisana4097 3 жыл бұрын
Extremely extremely intelligent AI. You asked what next to try - Can you try - Self driving car with Raspberry Pi and Pytorch. A small rover
@developerpranav
@developerpranav Жыл бұрын
This is super cool! You explained it very well, really appreciate it! :D Thanks!
@cloudsystem3740
@cloudsystem3740 Жыл бұрын
honesty i never used anything like that but somehow you inspire me to start thinking how i can learn that stuff very nice video and source code thanks
@seesah
@seesah 4 жыл бұрын
this is so amazing!
@fteoOpty64
@fteoOpty64 3 жыл бұрын
Ultra Cool Dude!. Fantastic instruction process and very concise. Very Good. 101% grading!.
@andrewsimon6058
@andrewsimon6058 4 жыл бұрын
This video is exactly what I was looking for. All other voice assistant youtube guides use shitty Google services and other proprietary sources. Thank you. Looking forward for next vids on this topic. Also that'd be interesting if you reveal how much time have your machine spent on all that learning.
@theaihacker777
@theaihacker777 4 жыл бұрын
Next video coming up soon! For wakeword, training was really fast. only spent like 30 minutes training.
@leif1075
@leif1075 Жыл бұрын
@@theaihacker777 Was this process mostly fun and enjoyable? If not how did you not give up when it got hard and not get bored and frustrated? Thanks for sharing.
@rangefreewords
@rangefreewords 2 жыл бұрын
Awesome! I was looking for a totally offline LAN based smart A.I. like what you just presented Objective: To control everything on my sailboat, take helm, drop anchor, play music or movies from my pi based server and work alongside my autopilot systems and chart plotter. I know often during my voyages I won't be accessing any internet but, I want to have all the same ubiquitous control as a smart home, etc. I am trying to source as much as I can from KZbin to build a decent system. Keep up the awesome work!
@subhajitkundu7546
@subhajitkundu7546 2 жыл бұрын
Hey, good day mate, The project that you talked about sounds awesome. I am just checking in to know how the project is coming along and where are you headed with this project currently.
@danielogunlolu
@danielogunlolu Жыл бұрын
I am working on something similar on channel. Kindly check it out
@mathewpatterson2187
@mathewpatterson2187 Жыл бұрын
Ahoy there! I'm also checking in, how's it going with the project sounds awesome and very reminiscent of what I want to do! Have you had much success?
@daanzap
@daanzap 11 ай бұрын
I have the exact same idea! It's going to feel a bit like being on the enterprise.
@xYASMINNN
@xYASMINNN 3 жыл бұрын
Love this! 🤍
@kellbooby265
@kellbooby265 3 жыл бұрын
Love u bro I had didn't find this information even in google....thanks a lot..
@antoniomeraz520
@antoniomeraz520 3 жыл бұрын
Dude you´re awesome!!! make more of this
@anshpatel8083
@anshpatel8083 3 жыл бұрын
“ Apple’s okay. “ Im subscribing this channel
@egs-zs8-127
@egs-zs8-127 3 жыл бұрын
Great video! Thank you so much!
@manuelherrerahipnotista8586
@manuelherrerahipnotista8586 2 жыл бұрын
Very clear explanation. Thanks a lot
@mingtang9823
@mingtang9823 3 жыл бұрын
This is very helpful! Thanks a lot. I subscribed!
@r7rahuls
@r7rahuls 3 жыл бұрын
Finally.....I got what I was looking for. ❤️
@400DaysUnicorn
@400DaysUnicorn 3 жыл бұрын
Couldn't find part 2 and so on. Would love to do this project for myself. Thank you!
@getalife6654
@getalife6654 3 жыл бұрын
I really like your content and wanted to try this myself :)
@PritishMishra
@PritishMishra 3 жыл бұрын
You are a genius.. Subscribed !!
@amitg2k
@amitg2k 2 жыл бұрын
Awesome...way to go
@தமிழோன்
@தமிழோன் 3 жыл бұрын
I wonder why your channel is still not famous. 🤔You deserve millions of subscribers!!!
@justsomeguywithtattoos6267
@justsomeguywithtattoos6267 3 жыл бұрын
This could be easily applied for translations, so that you can have an earplugs that instantly translates what someone is saying to you
@mauricioandrestiznadoroman3460
@mauricioandrestiznadoroman3460 Жыл бұрын
production. Thanks again!
@janardannn
@janardannn 4 жыл бұрын
Amazing dude :O
@NathanaelNewton
@NathanaelNewton Жыл бұрын
"Ok, Now that we got the code.." Wow.. that was so information dense.. SUBBED AND BELL WOW.. I'm going to learn a lot here I think :)
@mcudgir1291
@mcudgir1291 Жыл бұрын
This helped a lot thank you
@dillonridder8737
@dillonridder8737 4 жыл бұрын
"Apple's okay" lol
@schwarzarbyter
@schwarzarbyter 3 жыл бұрын
thats exactly when this video got its 72nd dislike.
@HappiFix
@HappiFix 3 жыл бұрын
daaaaayum good stuff homie g gangsta
@romicasimiaisialtele366
@romicasimiaisialtele366 Жыл бұрын
Thank you Mike! I'm just starting out and tNice tutorials video really helped get the basics down!
@ramoncaceres4399
@ramoncaceres4399 3 жыл бұрын
Pretty interesting. For about 2 years I’ve been obsessed with turning my house into an AI assistant. Yet have the voice Overlay of ( BT from titanfall ). Yet pretty hard to do that.
@JohnSmith-ox3gy
@JohnSmith-ox3gy 2 жыл бұрын
Why mess with perfection?
@ShivamVerma-gq2sm
@ShivamVerma-gq2sm 4 жыл бұрын
Why not go in the sequence you already told? Incredible video ! I read a blog of yours on medium about LSTM, quite good explaination. Thanks man for such an awesome stuff .
@theaihacker777
@theaihacker777 4 жыл бұрын
Always a logical choice! but just wanted to know what the viewers are most interested in seeing next
@RomansapienMVision
@RomansapienMVision 2 жыл бұрын
Thanks for sharing. I was looking for a video that would get my head spinning then eventually put me to sleep.
@DJWangDJ
@DJWangDJ Жыл бұрын
dude, you are awesome!
@alexzab7653
@alexzab7653 3 жыл бұрын
It's amazing!!!
@akashdhage
@akashdhage 3 жыл бұрын
Great learning thanks a lot. I have gone through the video completely,if the audio signal is split with a equal diffrence for eg:2 sec then it may result in loss of information as a the split may occur at the middle of word
@applabstudio3155
@applabstudio3155 3 жыл бұрын
Thank you very much❤️❤️❤️❤️please continue
@CleffnaTV
@CleffnaTV Жыл бұрын
Amazing!!!
@DrSmart20
@DrSmart20 Жыл бұрын
haha i looked the idea up like a year ago thinking it would be cool and everything i found was just "attatch spare phone to a speaker" lol this just showing up in my feed now is so exciting
@vanmanhle5642
@vanmanhle5642 3 жыл бұрын
That's really great
@tuananhlam90
@tuananhlam90 4 жыл бұрын
Hey great tutorial! Looking forward to your written guide! But question/request for you: if I don't have access to expensive deep learning hardware setup like you have here, can you do another separate tutorial series on how to build/train model on say AWS/GCP?
@theaihacker777
@theaihacker777 4 жыл бұрын
Totally can consider that. But I think there are quite a few tutorials like that out there. Also i would recommend using google colab since it’s free and good enough for small projects
@mbp5863
@mbp5863 2 жыл бұрын
You make some pretty damn good content!:) When is the next video coming?
@marwinsteiner5817
@marwinsteiner5817 3 жыл бұрын
Hey you, I have the same computer case like you! Interesting video!
@lalitbhargava3993
@lalitbhargava3993 3 жыл бұрын
Very informative
@sreeram9220
@sreeram9220 2 жыл бұрын
I'm not gonna lie when I hear your one minute speech I find some hope
@scollyb
@scollyb 3 жыл бұрын
Great video. Have you tried the simple solution of adding examples of you saying the close phrases to the training set. Simplest way would be just to add many copies of them to the set and retrain. Possible more robust way would be to add a second stage to the process trained on only your voice.
@lukewhatley8043
@lukewhatley8043 4 жыл бұрын
Awesome video! Do you know if the dataset you decide to train it on has to be in .wav format? So close to getting your example working! Let me know if theres somewhere I can ask some questions regarding your code. Again great video man!
@theaihacker777
@theaihacker777 4 жыл бұрын
Hey Luke! If you have discord, join the discord server and I can help you there. The link is in the description.
@dheerajrishimusic
@dheerajrishimusic 3 жыл бұрын
this is very nice
@saltofearth4902
@saltofearth4902 3 жыл бұрын
Omgurd your awesome.. First five min freaking had me laughing hard!
@mitchellstewart6745
@mitchellstewart6745 9 ай бұрын
It’s a shame you didn’t keep this repo up I just got a raspberry pi and this is my intentions but I don’t know enough about ML or engineering to pull it off 😅 thanks for the content!
@UmbraAtrox_
@UmbraAtrox_ 2 жыл бұрын
THANK YOU!
@hs4lhp828
@hs4lhp828 3 жыл бұрын
Interesting video. Looks like fun. You're smart af. In other news, I've always suspected that I'm dumb af. This is now confirmed.
@halimaujunwa9533
@halimaujunwa9533 3 жыл бұрын
Oh yhhh......I got lost in that mathematical modelling part tho but still cool.....really cool
@wearesisterstube7354
@wearesisterstube7354 3 жыл бұрын
Good job
@myrthestruver5262
@myrthestruver5262 Жыл бұрын
you have no idea how proud I am I even understood half of this haha
@benniegant
@benniegant 3 жыл бұрын
Wow, My AI Assistant is working now Thanks!!!
@lionellow105
@lionellow105 Жыл бұрын
Hey sorry for bothering but would it be okay if I ask you some questions on how you got it to work? Especially the raspberry Pi part
@harjunmnath
@harjunmnath 2 жыл бұрын
common man, give us the next video of this series we have been waiting too long
@bradc6056
@bradc6056 3 жыл бұрын
I haven’t seen the rest yet, but have you thought of creating a blacklist of all the words that sounds like Wally, and that should increase overall accuracy.
@onlyyou200548
@onlyyou200548 2 жыл бұрын
good job.
@RichardBaileyrichoncode
@RichardBaileyrichoncode 4 жыл бұрын
Fun project.
@sunilshewale913
@sunilshewale913 3 жыл бұрын
awesome
@GalacticTechTrails
@GalacticTechTrails 2 жыл бұрын
random shit!! Love it. Your the man!
@benni5541
@benni5541 2 жыл бұрын
Im Currentlx using rasphy. Although its pretty neat and does EVERYTHING, it does not leave much room for the very tech savy users. I guess i will tinker a bit with your code and incorperate parts. But i need to replicate my current satellite nodes with printed pcb first. So much todo :D
@JorgeHernandez-iw2fd
@JorgeHernandez-iw2fd 3 жыл бұрын
How does someone even get to this level? I'm assuming years of practice and stuff but what/how did you practice?
@shubhamthapa7586
@shubhamthapa7586 2 жыл бұрын
no one year is enough
@PiyushSharma-od2el
@PiyushSharma-od2el 3 жыл бұрын
Nice making Terminator A.i
@rupertbowen-jones858
@rupertbowen-jones858 3 жыл бұрын
Have you considered the NVIDIA Jetson Nano or the Jetson Xavier NX? Could be a better and more powerful solution than a humble pi? Looking forward to working through this project though... great videos and git. Happy New Year!
@szczurekk1155
@szczurekk1155 Жыл бұрын
I also really like softEX, it has a very nice effect to it
@robertsutkowski3170
@robertsutkowski3170 11 ай бұрын
Nice
@MrBobWareham
@MrBobWareham 3 жыл бұрын
How did you learn so much and know what to type? It just looks so complicated you are awsome
@TheAcujlGamer
@TheAcujlGamer 3 жыл бұрын
Did I just found a small creator that makes great & fun pyTorch content?
@ObitoUchiha-be1jo
@ObitoUchiha-be1jo 3 жыл бұрын
I was thinking of building too using raspberry xD. I'm surprised u didn't include cortana
@VonchkynProduction
@VonchkynProduction 3 жыл бұрын
Do you have more projects coming up? it'd be cool to see some more projects or tutorials
@johnsnow5510
@johnsnow5510 2 жыл бұрын
Very nice ideea! It may be a dumb question, but here it goes: can the recorded voice be in other language than english, and by using the same principle get similar results? I'd like to create an assistant that recognizes speech input in real-time and returns information like weather, youtube videos etc.
@randallnorwood6803
@randallnorwood6803 Жыл бұрын
Well I must say that I'm very interested
@dr.mikeybee
@dr.mikeybee 2 жыл бұрын
Nicely done. You need to add a bunch of Molly Dolly ground truth to the training set. That should fix it. It's great that you have it running on a pi. Nevertheless, you are going to need speech recognition, and I'm not sure vosk or deep speech will run on a pi, and if it does, you really don't need a wake word detector model. If the speech recognition model can understand the wake word, that can be your detector model too. The only reason for a wake word detector is to avoid going out to the cloud.
@andressastre6298
@andressastre6298 4 жыл бұрын
do the NLU next!!
@karanrathod8555
@karanrathod8555 3 жыл бұрын
sir please complete full course of this AI soon...
@provakar5496
@provakar5496 3 жыл бұрын
Thanks a lot, I just made google assistant!
@lionellow105
@lionellow105 Жыл бұрын
Hey, what wakeword did you use
@wmacosx
@wmacosx 3 жыл бұрын
Hi, thanks for the great video, can you tell me how much power/cpu this wake-word implementation consumes? I've always wanted to build something like that but I'm worried about the PI having to constantly process audio to catch that wake-word that I would use 2 or 3 times a day. I was thinking that maybe for the wake-word part, some specialized hardware would be preferable, so that the PI will be under load only for a short period of time after each wake-word activation.
@shivansharma1310
@shivansharma1310 3 жыл бұрын
Nice bro, also Which voice in assistant didi you use?? or you just created your own??
@badcamerax
@badcamerax 3 жыл бұрын
this guy is genious
@sunilshewale913
@sunilshewale913 3 жыл бұрын
liked and subscribed
I Built a Personal Speech Recognition System for my AI Assistant
16:32
3 wheeler new bike fitting
00:19
Ruhul Shorts
Рет қаралды 52 МЛН
ОДИН ДЕНЬ ИЗ ДЕТСТВА❤️ #shorts
00:59
BATEK_OFFICIAL
Рет қаралды 7 МЛН
My little bro is funny😁  @artur-boy
00:18
Andrey Grechka
Рет қаралды 8 МЛН
Always be more smart #shorts
00:32
Jin and Hattie
Рет қаралды 37 МЛН
ChatGPT + Raspberry Pi -  A ChatGPT powered Raspberry Pi chatbot
6:35
Thomas the Maker
Рет қаралды 68 М.
But what is a neural network? | Chapter 1, Deep learning
18:40
3Blue1Brown
Рет қаралды 16 МЛН
I Built a Neural Network from Scratch
9:15
Green Code
Рет қаралды 131 М.
PyTorch in 100 Seconds
2:43
Fireship
Рет қаралды 849 М.
My JARVIS Program with Arduino Quick Demo
7:11
Codesempai
Рет қаралды 481 М.
12 New AI Projects using Raspberry-Pi, Jetson Nano & more
7:50
ToP Projects Compilation
Рет қаралды 654 М.
Raspberry Pi AI: Picroft Voice Assistant
21:34
ExplainingComputers
Рет қаралды 161 М.
Build Your Own AI Assistant Part 1 - Creating the Assistant
1:07:36
Kevin McAleer
Рет қаралды 104 М.
3 wheeler new bike fitting
00:19
Ruhul Shorts
Рет қаралды 52 МЛН