Build your own real-time voice command recognition model with TensorFlow

  Рет қаралды 50,223

AssemblyAI

AssemblyAI

Күн бұрын

In this TensorFlow Tutorial we build our own real-time voice command recognition model that can then control a game.
Tutorial + Colab: www.tensorflow.org/tutorials/...
Code: github.com/AssemblyAI-Example...
Get your Free Token for AssemblyAI Speech-To-Text API 👇www.assemblyai.com/?...
▬▬▬▬▬▬▬▬▬▬▬▬ CONNECT ▬▬▬▬▬▬▬▬▬▬▬▬
🖥️ Website: www.assemblyai.com
🐦 Twitter: / assemblyai
🦾 Discord: / discord
▶️ Subscribe: kzbin.info?...
🔥 We're hiring! Check our open roles: www.assemblyai.com/careers
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
Timeline:
00:00 Intro
00:52 Build Model: Google Colab Walkthrough
05:09 Save & Download model
08:15 Add our preprocessing code
12:39 Code the final project with microphone input
18:44 Final project testing!!!
Microphone icon created by Freepik - Flaticon: www.flaticon.com/free-icons/m...
#MachineLearning #DeepLearning

Пікірлер: 33
@cornpop3340
@cornpop3340 Жыл бұрын
This is an incredibly helpful video.
@laviniacaldas2452
@laviniacaldas2452 Жыл бұрын
It works =) Thanks!
@donahue1187
@donahue1187 Жыл бұрын
This is fantastic. I’m a Newbie to Python and neural nets, but your explanations are great and pretty straightforward. Question - what additional steps would I take to run this on my own local device (pi 4)? And what else would I need to do to introduce new commands such as as trigger word and “turn off the lights”? Would I need to create my own audio samples, save them to new folders, and retrain to retrain the model? Thanks for any guidance! (if you couldn’t tell I’m DONE w Google Home latency, recreating my own. Ambitious! Need help!)
@gokhanersoz5239
@gokhanersoz5239 2 жыл бұрын
Thank you very much for the tranings. But I think there should be a more complex and more advanced voice recognition, voice classification and similar training series if you see fit. You know, trainings on sound are limited.
@erickd4816
@erickd4816 Жыл бұрын
Good video, excellent explanation, I have a question, can the same program be trained to recognize only a specific voice? if so, could you explain it to me? I would be very grateful.
@Cyka_Blyatus
@Cyka_Blyatus Жыл бұрын
What did you do so the program does not picks up ambient noise or actually works with the commands given? it seems the model lacks ambient noise data sets and whenever ran it only keeps spamming the first command, but yours works perfectly, how to achieve this?
@oxydol3456
@oxydol3456 4 ай бұрын
This tutorial is great. I find that the key to build accurate model is gathering quality data a lot. And that sounds arduous work. didn't get good result with 200 examples. Edit: I found the model's accuracy is the way poor than I expected. Maybe it's due to the microphone I'm using and it's needed to taken care of before predicting process.
@nguyent3465
@nguyent3465 Жыл бұрын
The code on TensorFlow website was changed :(
@geekyprogrammer4831
@geekyprogrammer4831 2 жыл бұрын
Can you please post building text to speech models from scratch?
@MrIlvis
@MrIlvis 7 ай бұрын
On which Tensorflow version this was made? because Colab uses latest, but older one should work without problems.
@itsrairamones
@itsrairamones Жыл бұрын
thankyou dude its a hundred percent work for me but after couple minutes it crashed :(
@seanadin386
@seanadin386 10 ай бұрын
Can you do a video regarding the newer version? The run interface now has a different code
@tvartalk
@tvartalk 2 ай бұрын
😊
@divyakhetan8754
@divyakhetan8754 6 ай бұрын
Is it a customised model (designed for a single person) or it can work on anyone's command
@TheSaukkio
@TheSaukkio 6 ай бұрын
How can it be that in the video it gives nothing with out speaking. While if i run the code from github it predicts random stuff when im not speaking.
@swasthikk3655
@swasthikk3655 5 ай бұрын
Can i get similar for English alphabets
@clumsycoder1907
@clumsycoder1907 Жыл бұрын
its not working for me
@loydvincentbutron4345
@loydvincentbutron4345 4 ай бұрын
is it for english voice only?
@danielbogemann1598
@danielbogemann1598 10 ай бұрын
They changed the Code. Could u you do a quick update?
@sanjeetjha9177
@sanjeetjha9177 6 ай бұрын
Please provide me the model i need argently I am stuck in it
@tankado_ndakota
@tankado_ndakota Ай бұрын
Got the error: "Could not import the PyAudio C module 'pyaudio._portaudio'." And couldn't find the solution... Macbook M1 Pro
@tankado_ndakota
@tankado_ndakota Ай бұрын
I saw a note in other video for M1 :) let me try first :D
@tankado_ndakota
@tankado_ndakota Ай бұрын
i did everything that I found from web. but still i got the error: "symbol not found in flat namespace '_PaMacCore_SetupChannelMap'"
@rediet.f261
@rediet.f261 Жыл бұрын
what is sample_file in here 8:38
@clumsycoder1907
@clumsycoder1907 Жыл бұрын
same doubt
@arqamrafay
@arqamrafay 10 ай бұрын
exactly, i think their is file of recorded audio
@LukasKofler
@LukasKofler 8 ай бұрын
See the first line at 5:38 🙂
@Yvtq8K3n
@Yvtq8K3n Жыл бұрын
Its a shame, you cant train your own model.
@threepe0
@threepe0 Жыл бұрын
of course you can
@Yvtq8K3n
@Yvtq8K3n Жыл бұрын
@@threepe0 The last time I used this, you were unable to create a custom model and use it. Tensorflow provided you with an already trained model (0-1, left, right) and thats exactly what most people use.
I Built a Personal Speech Recognition System for my AI Assistant
16:32
Incredible magic 🤯✨
00:53
America's Got Talent
Рет қаралды 68 МЛН
THE POLICE TAKES ME! feat @PANDAGIRLOFFICIAL #shorts
00:31
PANDA BOI
Рет қаралды 24 МЛН
3M❤️ #thankyou #shorts
00:16
ウエスP -Mr Uekusa- Wes-P
Рет қаралды 14 МЛН
Build a Deep Audio Classifier with Python and Tensorflow
1:17:11
Nicholas Renotte
Рет қаралды 162 М.
Let's build GPT: from scratch, in code, spelled out.
1:56:20
Andrej Karpathy
Рет қаралды 4,5 МЛН
100+ Linux Things you Need to Know
12:23
Fireship
Рет қаралды 740 М.
Build a Speech Recognition System on a Raspberry Pi
6:09
AssemblyAI
Рет қаралды 56 М.
Incredible magic 🤯✨
00:53
America's Got Talent
Рет қаралды 68 МЛН