I Built a Personal Speech Recognition System for my AI Assistant

  Рет қаралды 256,471

The AI Hacker

The AI Hacker

Күн бұрын

This video shows you how to build your own real time speech recognition system with Python and PyTorch. It walks you through the deep learning techniques that are effective when modeling speech problems, as well as code to build your own.
⭐ Play and Experiment With the Latest AI Technologies at grandline.ai ⭐
This video is the second episode of the series "How to build your own A.I. voice assistant with Pytorch"
• Build an AI Voice Assi...
Github:
github.com/LearnedVector/A-Ha...
Pre-Trained ASR Model:
drive.google.com/file/d/1jcNO...

Пікірлер: 301
@totoma3297
@totoma3297 3 жыл бұрын
this is michael reeves from the universe where he decided to do something useful with his life
@isawcornflakes6201
@isawcornflakes6201 3 жыл бұрын
LMFAOOOO DIDNT HAVE TO DO HIM LIKE THAT 😭☝️
@aliveandwellinisrael2507
@aliveandwellinisrael2507 2 жыл бұрын
6:57 yep
@UmbraAtrox_
@UmbraAtrox_ 2 жыл бұрын
Wow, that's mean bro.
@justinross2664
@justinross2664 2 жыл бұрын
kzbin.info/www/bejne/n6rPZmawrt9osM0
@DayoBrandon
@DayoBrandon 2 жыл бұрын
Imagine the greatest Michael colab. The two of them plus Michael Stevens (vsauce)
@joeyrivenbark5056
@joeyrivenbark5056 2 жыл бұрын
Hey man, I really like how you have written definitions in addition to your speaking, helps a lot.
@akulgoel9259
@akulgoel9259 Жыл бұрын
This is so good, I remember seeing this video a year ago and wishing he'd continued the series.
@zacknawrocki
@zacknawrocki 3 жыл бұрын
I've been looking forward to this part of the series the most! I've been trying to create/run a voice assistant locally, and could not figure out how to apply speech recognition without relying on Google's Python module (which i was trying to avoid for privacy reasons, defeating the purpose of making one) and the HMM basics in my Intro to AI course weren't enough to implement it. This is fantastic.
@justinross2664
@justinross2664 2 жыл бұрын
kzbin.info/www/bejne/n6rPZmawrt9osM0
@OtRatsaphong
@OtRatsaphong 2 жыл бұрын
Wow, just discovered your channel. Great work. I'm just starting my journey into Deep learning and speech recognition. Will be following your progress.
@victor7ultimate
@victor7ultimate 3 жыл бұрын
After watching this video, I literally took off my hat as a mark of respect to this. Cant thank you enough. Thanks a million
@fteoOpty64
@fteoOpty64 3 жыл бұрын
Loved the high speed speech part!. Well done. Excellent production Mike!. TQ
@justinross2664
@justinross2664 2 жыл бұрын
kzbin.info/www/bejne/n6rPZmawrt9osM0
@swarajshinde3950
@swarajshinde3950 3 жыл бұрын
Loved it Man , Great Video !
@CraftClone1
@CraftClone1 3 жыл бұрын
This is awesome! I wish there was more content from you One Ai hacker to another, keep on going!
@chenjus
@chenjus 3 жыл бұрын
Really dope video. Can't wait to see your next one.
@theroyal1914
@theroyal1914 3 жыл бұрын
we need programmers like you. For advance learning.
@justinross2664
@justinross2664 2 жыл бұрын
kzbin.info/www/bejne/n6rPZmawrt9osM0
@davidkim2389
@davidkim2389 3 жыл бұрын
When next?? Best Series ever!! Please post next!!
@alexkonopatski429
@alexkonopatski429 3 жыл бұрын
this series is so cool! keep it up bro
@briankim49
@briankim49 3 жыл бұрын
Loved the video. You really showed me the tools I could use to build my own speech recognition model!
@justinross2664
@justinross2664 2 жыл бұрын
kzbin.info/www/bejne/n6rPZmawrt9osM0
@JoshuaHerath
@JoshuaHerath 3 жыл бұрын
This video is so high quality wish you uploaded more
@Alex.In_Wonderland
@Alex.In_Wonderland Жыл бұрын
omg, thank you! every other video I look up on this subject is just an ad for a text-speech readers! thanks for going into such detail about your thought process, buut after looking at the rig you have vs the one I've got ... well. . . if it took you a handful of days, it'd take me a week or two LOL great video! thanks a lot!
@neilosborne8682
@neilosborne8682 2 жыл бұрын
This is excellent! (subscribed!) I had to quickly brush up my skills for a project I'm working on (will be open sourcing it soon!) - and this video was short, sweet and to the point! Thanks
@gauravshipurkar1570
@gauravshipurkar1570 2 жыл бұрын
Bro you are freaking awesome!!! i love your content, helps a lot.
@sirlightshadowslayer473
@sirlightshadowslayer473 7 ай бұрын
This was insane, gonna try to do similar now, thank you for the informations
@mtaneesh1411
@mtaneesh1411 3 жыл бұрын
This was a really good video dude. Can you tell me how to make the soundwave display that you had while testing the model
@sreerajsathish3635
@sreerajsathish3635 3 жыл бұрын
Omg the video i was looking for thank for making one..... Full support❤
@smeagol92055
@smeagol92055 3 жыл бұрын
I'm building my own wearable AI assistant and this series is **exactly** what I was looking for! Great stuff!
@strange5700
@strange5700 3 жыл бұрын
Can you make tutorial
@justinross2664
@justinross2664 2 жыл бұрын
kzbin.info/www/bejne/n6rPZmawrt9osM0
@PonchoManOG
@PonchoManOG Жыл бұрын
dude no way same
@morraza3307
@morraza3307 Жыл бұрын
@@PonchoManOG does this tutorial still work?
@PonchoManOG
@PonchoManOG Жыл бұрын
@@morraza3307 yes
@thiscrow
@thiscrow 3 жыл бұрын
at the beginning of the video: Oh I see ! 6:57 : Oh I ... oh ...
@chrisw1462
@chrisw1462 3 жыл бұрын
A Cue Stick - used for playing billiards. Acoustic (a-COO-stick) - dealing with sound or audio energy.
@vicehaiti914
@vicehaiti914 3 жыл бұрын
Keep going bro.full support
@rahulkumarm1446
@rahulkumarm1446 3 жыл бұрын
Brooo.I really dont know whether u coded this or just took reference from something....idrc u are AMMMMMAAAAZZZZIIINGGGGG.Hats off 2 u.U have a great talent man.......u could be the next ceo of any big fours too....
@itumelengmothapo2456
@itumelengmothapo2456 3 жыл бұрын
thank you man... this was fun to watch
@kalyanstock8058
@kalyanstock8058 Жыл бұрын
Wow...who knew you can make AI teaching so much fun....You should make more videos
@SivaShankarsss
@SivaShankarsss 3 жыл бұрын
Eagerly waiting
@jairojosy5985
@jairojosy5985 3 жыл бұрын
Keep going on and finish the project fast. I'm looking ahead for the project to be finished
@user-jj8qh5lm7u
@user-jj8qh5lm7u 2 жыл бұрын
i think this is a very good video for me ,It can not only let me learn some knowledge, but also make me feel relaxed.thank you
@jumbejolly3129
@jumbejolly3129 2 жыл бұрын
Man your a genius man. I wish I could do this. I have some many ideas but dont know where to start.
@fteoOpty64
@fteoOpty64 3 жыл бұрын
Love your War Machine!. I build my first Pentium Pro Dual Proc decades ago. It had a special powersupply and I had to rig my Generic case to fit the Tyan motherboard!. It ran Linux then.
@kevinrtres
@kevinrtres 2 жыл бұрын
Thanks for the information. Just goes to show that the idea that we evolved is just sheer madness.
@michealhall7776
@michealhall7776 3 жыл бұрын
I'm enjoying discovering all these smaller ai channels
@alexandergrayson9856
@alexandergrayson9856 3 жыл бұрын
Hey pal, your work's great I love it 🙌🙌
@hemanth8195
@hemanth8195 3 жыл бұрын
This is really nice work dude
@kimkubik7547
@kimkubik7547 2 жыл бұрын
You Michael Rock!!!! Way to teach!!!
@zikpin
@zikpin 3 жыл бұрын
This is what i was looking for, thanks
@jtlunsford780
@jtlunsford780 Жыл бұрын
Totally awesome. Understood about .5% (that's point 5%). Just got my headset set up in Win 10 and am loving it. You're awesome and I bow to your knowledge and expertise....thanks for the cool vid. It was not wasted on my limited knowledge, but it peaked my interest...thanks again...JT
@tripathi26
@tripathi26 3 жыл бұрын
This is awesome! thanks man.
@CreateYourWorld1
@CreateYourWorld1 3 жыл бұрын
Planning on creating my own Jarvis, this video has given me an insight.
@s1krrpilot
@s1krrpilot 2 жыл бұрын
Same, I'm going to call mind Alfred and integrate it into my helmet
@yashrajhawle4
@yashrajhawle4 3 жыл бұрын
Thank you for sharing your knowledge !
@UttamDas-ub5ow
@UttamDas-ub5ow 3 жыл бұрын
This man is really a hero 👍💓
@adeniyiadeboye3300
@adeniyiadeboye3300 3 жыл бұрын
Thanks for this..I am going to thoroughly go through the speech recognition your code on Github
@nathancook8452
@nathancook8452 2 жыл бұрын
Excellent video, you helped me out tremendously
@w3w3w3
@w3w3w3 Жыл бұрын
your videos are great bro! 🤝
@redtako.
@redtako. 7 ай бұрын
THIS ONE WAS REALLY FUNNY gj love keep up the uploads :)
@diegomartin6332
@diegomartin6332 3 жыл бұрын
Please post more videos about this!
@JasonTRogers
@JasonTRogers 2 жыл бұрын
Hey Michele, your videos on AI is fantastic! I haven’t seen any videos lately and I am course what you are doing these days?
@seannam1218
@seannam1218 3 жыл бұрын
This is incredibly educational. Thx for sharing ur knowledge for free!
@justinross2664
@justinross2664 2 жыл бұрын
kzbin.info/www/bejne/n6rPZmawrt9osM0
@soonapaana24
@soonapaana24 3 жыл бұрын
You are totally awesome bro...👏👏👏
@angelgabrielortiz-rodrigue2937
@angelgabrielortiz-rodrigue2937 3 жыл бұрын
Wao, great video man. Really awesome stuff
@shannonsteward4034
@shannonsteward4034 Жыл бұрын
hi great work I just found your channel great job
@DavidAlvesWeb
@DavidAlvesWeb 3 жыл бұрын
what a great video man, really inspiring! keep up the good work! PS: you deserve a better t-shirt bro 😅
@muhammadrezahaghiri
@muhammadrezahaghiri 3 жыл бұрын
Can you make a TTS using deep learning? :) I really want to see that.
@benceelmokovacs1422
@benceelmokovacs1422 2 жыл бұрын
Wo hooo! This thing for FREE?! And help for us how to make it ours?! This data worth a HUGE amount of money, but you shared it! I'm so much surprised, in the good term! Thanks, thanks, thanks for it!! I really want to make an own Virtual Assistant, so big thanks for this video, for the data and for the help! Be blessed!
@PritishMishra
@PritishMishra 3 жыл бұрын
Why aren't you uploading more videos? I have already seen this video just came here to say... plzz upload it's been 7 months now!
@tilahunanagaw6175
@tilahunanagaw6175 Жыл бұрын
what interesting presentation it is!!!
@hg4lyfe
@hg4lyfe 3 жыл бұрын
Bro this is perfect wow thanks
@stereopsych6381
@stereopsych6381 Жыл бұрын
Please upload more!
@vladiklass1890
@vladiklass1890 3 жыл бұрын
Cool video!!! This will help me a lot with my first NLP project. I wanted to get radio voice data and transcribe it. Any tips on that? Btw you should come up with a more memorable outro! :D
@rodios-md5du
@rodios-md5du Жыл бұрын
You are gold💛
@aviavinav7208
@aviavinav7208 3 жыл бұрын
Great Video!
@SpeechProductivity
@SpeechProductivity 3 жыл бұрын
Very informative!
@superaluis
@superaluis 3 жыл бұрын
Awesome content!
@emrehankaraoglu4122
@emrehankaraoglu4122 2 жыл бұрын
This is such a amazing video. Congrats! I am wondering about model deployment part. Are you going to share the coding part of ıweb interface? The sound wave and the text that occurs below the sound wave are awesome.
@adibakhan2865
@adibakhan2865 10 ай бұрын
Hey did you found the code for deployment
@kadaliakshay6770
@kadaliakshay6770 4 ай бұрын
bro amazing wrapping on 6:54
@danieleangelini6238
@danieleangelini6238 3 жыл бұрын
Fantastic Bro 💞
@DasToastbrotToast
@DasToastbrotToast 3 жыл бұрын
Which hardware accelerator are you going to use, if any? As far as I know the net would be quite slow on the Pi itself hence a hardware accelerator like the Intel NCS or Google Coral would be useful, wouldn't it?
@microgamawave
@microgamawave 2 жыл бұрын
You can make a video about gait recognition biometrics in python recognized you from your walk model
@deelordthegreat
@deelordthegreat Жыл бұрын
THANK YOU!! ✌
@itsjustsam04
@itsjustsam04 3 жыл бұрын
Wow I love it! I do have two questions tho. 1 how did you run it in ur Chrome browser. 2 how did u get the cool visual effects for while u were speaking?
@jasminecheung1998
@jasminecheung1998 Жыл бұрын
This is a helpful video. I have a question regarding to the audio augmentation. In my project, the test speaker is not in the train data, so my model performers pretty bad on test set,only 50% accuracy. I try to use the pitch shift to agument my train data but doesn't works well. How should I use audio augmentation for this dataset?
@peterhu3362
@peterhu3362 2 жыл бұрын
Nice tutorial!
@peacekeepermoe
@peacekeepermoe 3 жыл бұрын
Great content dude. I haven't seen anything new for the last 7 months though. Hope you're well :)
@Tera2Space
@Tera2Space Жыл бұрын
Hello, when will there be a guide to creating your own speech synthesis? (TTS)
@guidoscalise
@guidoscalise 2 жыл бұрын
What books/material would you recommend to someone wanting to learn to design models like the one you’re detailing around 7:36?
@pranavthakur6744
@pranavthakur6744 2 жыл бұрын
Can you make a detailed video how did you manage to make it. I want to learn it.
@masudtalukdar5672
@masudtalukdar5672 3 жыл бұрын
Cool project ❤
@maryamnazari1281
@maryamnazari1281 10 ай бұрын
great job! i want to train a speaker identification project..any ideas where to start?
@justinfuruness7954
@justinfuruness7954 3 жыл бұрын
Do you have any recommendations for how to learn AI? How long did this take to train?
@kellbooby265
@kellbooby265 3 жыл бұрын
Can u make a AI voice assistant for Linux or Windows. Which we can train
@dineshlamarumba4557
@dineshlamarumba4557 3 жыл бұрын
which ASR framework did you used? What is your thought on fairseq wav2vec for this purpose?
@aakaashshroff1672
@aakaashshroff1672 3 жыл бұрын
Can you please make a video on making your own speech synthesizer
@AlanJames1987
@AlanJames1987 Жыл бұрын
Good video but are you using Linix at 9:31 and Windows at 9:34? I haven't used Windows in a few years so I didn't know you could do this.
@tomhamser7216
@tomhamser7216 3 жыл бұрын
Could you show the code in detail or how I can use it with another model? Could I use a deepspeech model for testingit, too?
@bhanuexcalibur
@bhanuexcalibur 3 жыл бұрын
when do we expect the next part? NLU and skills waiting for it
@MrDonald911
@MrDonald911 3 жыл бұрын
Hey ! I just discovered your channel, nice content ! Your model seems overfitting, I think you should evaluate it on a test data (and not the validation). I would be curious to know how it would perform if you do hyperparameter tuning.
@justinross2664
@justinross2664 2 жыл бұрын
kzbin.info/www/bejne/n6rPZmawrt9osM0
@PaulClifford
@PaulClifford 3 жыл бұрын
Parts 3 & 4 haven't materialized in a year. I'd love to see the rest.
@shashwatgandhi7653
@shashwatgandhi7653 2 жыл бұрын
yes
@waisyousofi9139
@waisyousofi9139 Жыл бұрын
Thanks , Can you make a tutorial on code implementation of speech recognition. that would be great.
@elektroprogramming
@elektroprogramming 2 жыл бұрын
what's the different with speech_recognition library that we can use without training the data?
@mohammadrezakhalilishoja2701
@mohammadrezakhalilishoja2701 3 жыл бұрын
how did you up-sampled data to create 50 hrs from 1 hr?
@scarlett_j
@scarlett_j Жыл бұрын
Sorry to inform you, but you pretty much rock, at the same time solved this so I don't have to.
@NathanaelNewton
@NathanaelNewton Жыл бұрын
This looks like exactly what I need! Thanks for posting, I'm gunna follow along and watch tonight. One question.. Why are you using the auto generated subs on this video 😂😁
@notoltrexclearly2690
@notoltrexclearly2690 3 жыл бұрын
so it is possyble to make the virtual assistant write on another command prompt instead of talking, to use it with a custom text to speech AI? would love to see that
@vincebelansky425
@vincebelansky425 8 ай бұрын
Thank you for this video and the insight of how to design a voice recognition system independently from the ground up by an newly to AI. Most videos tell you to connect the internet and to a big server by google or someone else. The only question that I have is why use python and not C or C++, especially since you are running a raspberry pi with limited memory and slower CPU and the natural time restraints of real-time speech recognition?
@ZpErMy
@ZpErMy Жыл бұрын
Hello, I was fascinated with your Speech Recognition System. I wonder, could your system recognize sung musical notes? that is, instead of words, musical notation.
@tomhamser7216
@tomhamser7216 3 жыл бұрын
Did you have another tutorial as source or how did you do that?
@saishastech8023
@saishastech8023 3 жыл бұрын
Awesome research and implementation 👌 thanks for sharing ☺️ keep going 👍 new friend here 🙂
Универ. 13 лет спустя - ВСЕ СЕРИИ ПОДРЯД
9:07:11
Комедии 2023
Рет қаралды 6 МЛН
ОДИН ДЕНЬ ИЗ ДЕТСТВА❤️ #shorts
00:59
BATEK_OFFICIAL
Рет қаралды 8 МЛН
I’m just a kid 🥹🥰 LeoNata family #shorts
00:12
LeoNata Family
Рет қаралды 19 МЛН
Мы никогда не были так напуганы!
00:15
Аришнев
Рет қаралды 4,4 МЛН
Run your own AI (but private)
22:13
NetworkChuck
Рет қаралды 1,2 МЛН
How I’d learn ML in 2024 (if I could start over)
7:05
Boris Meinardus
Рет қаралды 949 М.
PiCroft - Build your own Voice Assistant
10:24
DrZzs & GrZzs
Рет қаралды 155 М.
Build your own Deep learning Machine - What you need to know
11:58
The AI Hacker
Рет қаралды 210 М.
World’s Fastest Talking AI: Deepgram + Groq
11:45
Greg Kamradt (Data Indy)
Рет қаралды 37 М.
Automatic Speech Recognition - An Overview
1:24:41
Microsoft Research
Рет қаралды 136 М.
PyTorch in 100 Seconds
2:43
Fireship
Рет қаралды 851 М.
Audio Data Processing in Python
19:52
Rob Mulla
Рет қаралды 148 М.
I Made a Neural Network with just Redstone!
17:23
mattbatwings
Рет қаралды 680 М.
Собери ПК и Получи 10,000₽
1:00
build monsters
Рет қаралды 1,9 МЛН
Simple maintenance. #leddisplay #ledscreen #ledwall #ledmodule #ledinstallation
0:19
LED Screen Factory-EagerLED
Рет қаралды 9 МЛН