Creating a Speech to Text Program with Python

  Рет қаралды 83,269

CS Coach

CS Coach

Күн бұрын

Пікірлер: 100
@danoconnell5292
@danoconnell5292 9 ай бұрын
I've spent 5 days trying to figure this out knowing nothing literally about nothing and by this I mean virtual assistants you can talk to. With the learning curve I will say I wish I watched your video first. Your the best to learn from so far my man nice job.
@CSCoach
@CSCoach 9 ай бұрын
That means a lot man. Thanks so much for the feedback.
@Knot2goodAtIt
@Knot2goodAtIt 6 ай бұрын
I had nooooo idea this was so straightforward! Thank you! I want to create a translator and I think this is the perfect base!
@File_corupt
@File_corupt 11 ай бұрын
yo this tutorial is great as I have tried to watch other tutorials as a noob but they talk like I know the stuff but I dont but you explain the stuff to me great
@goner007
@goner007 10 ай бұрын
this worked so well. i never knew it was that easy until you explained it. Hope to learn more from you thanks
@ymhtpat
@ymhtpat 10 ай бұрын
nvm figured it out. for those of you asking about the last step for windows os, go to note pad and type in " C:\Users\[your computer]> " then the first prompt then do the same for the second. save the file as a .txt. it should work from there.
@BobJoe-lt1is
@BobJoe-lt1is 3 ай бұрын
What second prompt are you talking about? I wrote that in the notepad and yet my program sometimes outright refuses to work.
@DevonAIPublicSecurity
@DevonAIPublicSecurity Жыл бұрын
Hey Oscar this was a great tutorial, keep doing the work you are doing and this was very clear and it made perfect sense.
@CSCoach
@CSCoach 9 ай бұрын
Thanks for the feedback. It means a lot. Going to ramp up the channel with more content in the coming months :)
@artetridimensionale
@artetridimensionale Жыл бұрын
ok so it's a bit simple, the problem is if you stop talking and you have to make sure the text is justified and doesn't end up at the end of a line every time you interrupt yourself
@ColinTimmins
@ColinTimmins Жыл бұрын
Cool stuff! I have struggled so much with language as I’m extremely dyslexic. ChatGPT has opened up the door 🚪 for me! 😊
@CSCoach
@CSCoach Жыл бұрын
That's awesome :D So glad ChatGPT is able to help you :)
@ymhtpat
@ymhtpat 10 ай бұрын
On the last step I'm hung up. I'm on the latest windows os, Are you running the touch output.txt and tail output on macs equivalent of command prompt? I ran it on mine and came up with " 'touch' is not recognized as an internal or external command, operable program or batch file." where did I go wrong?
@joyaljijimon3419
@joyaljijimon3419 7 ай бұрын
same here bruh
@poisoned_durian8
@poisoned_durian8 9 ай бұрын
im using windows im not so sure if the problem is the commands "echo. > output.txt and type output.txt | more"
@pattuchiitu8978
@pattuchiitu8978 11 ай бұрын
Thank you for the tutorial. its works now 👋
@MrScgaming28
@MrScgaming28 Жыл бұрын
Can you tell me how to do the last step of making that output file and tailing the outputs in a windows OS?
@anderson3889
@anderson3889 10 ай бұрын
did you find a way
@arvindh13
@arvindh13 9 ай бұрын
In windows it will record your audio & print it to the notepad, but you need to restart your notepad to see it.
@sandrasajeev8640
@sandrasajeev8640 5 ай бұрын
@@arvindh13 Hey! I'm also stuck at this step. Can you pls explain like where do I've to put the commands to touch and tail the output?
@mmjuuno
@mmjuuno Жыл бұрын
how do i see the appended msgs in the terminal? it worked and I could open the output txt file and see what i was saying, but I'm not sure how to see it realtime ig (using Windows 10 and the windows subsystem for linux to install kali linux, and then opened bash in cmd). also when i stop the script (using pycharm) it spits back a bunch of errors and I'm not sure why, or at least i think they're errors
@CSCoach
@CSCoach Жыл бұрын
I did this by running the tail command in another terminal. Though, you could also change line 43 to be print(text) rather than print("Wrote text")
@kavito5947
@kavito5947 Жыл бұрын
@@CSCoachwhats the name of the terminal you used?
@mohitpandya_2228
@mohitpandya_2228 10 ай бұрын
This is just working fine for the first run. After the first run the text generated takes about 10 mins to get to the output. How can i fix that thing and make it just as fast as the first run
@Illogical.
@Illogical. Жыл бұрын
I need a thing that does a few steps less than what this does. I only want it to record phonetics, maybe spacing between words, maybe intonation, and maybe recognize when I'm pausing to remember a word.
@loisisnel2955
@loisisnel2955 2 ай бұрын
Thanks for the course, but does anyone know how can i do it by using an audio file and not a voice recognition ?
@anirudhsrisai3397
@anirudhsrisai3397 Жыл бұрын
Think of integrating it with chatgpt and getting answers for it just by using the text which we have received from speech to text
@CSCoach
@CSCoach Жыл бұрын
Haha, did you see this video I made :) I believe I do what you suggested in the comment. kzbin.info/www/bejne/eHbaZnh8pqh2e6s
@tugpsx640
@tugpsx640 Жыл бұрын
This is great, thanks for sharing you tips and tricks.
@CSCoach
@CSCoach Жыл бұрын
You bet!
@KNOCKOUT-t7e
@KNOCKOUT-t7e 8 ай бұрын
@@CSCoach what is the name of the app u used in the video ??
@Yajnco
@Yajnco 8 ай бұрын
My language Hmong is not available for voice to text and I have been searching for programs or ways to show me how it's done, but can't find any. Not sure if Python is the solution. Frustrating of not knowing where to start.
@alexanderkartvelian4274
@alexanderkartvelian4274 Жыл бұрын
You are awesome! It would be great if you teach us how to send generated text from recognizer into "Text to speech" . thanks for the video!
@SirMrMystery
@SirMrMystery 10 ай бұрын
he already did something like that in which he created a jarvis like program kzbin.info/www/bejne/eHbaZnh8pqh2e6ssi=wEeSWa18kFPppBUF
@aotrakstar
@aotrakstar Жыл бұрын
Hi coach. do you think maybe using openai's whisper will have a more accurate outcome for transcribing speech?
@CSCoach
@CSCoach Жыл бұрын
I found the python library to be enough personally. Though, I would imagine whisper to be better given that it's made by openAI. I'd expect it to be a better trained AI
@hjoseph777
@hjoseph777 3 ай бұрын
You can use whisper offline. I suggest faster-whisper or whisper.cpp
@abaizkhan4963
@abaizkhan4963 3 ай бұрын
Any git repo link?
@snaxsammy6472
@snaxsammy6472 Жыл бұрын
Hi Coach! MrScgaming28 and I, from the comment section seem to have the same issue, could you re-explain how to do the last step of making that output file and tailing the outputs in a windows OS?
@arvindh13
@arvindh13 9 ай бұрын
Create a text document & copy the source --> paste it in the program.
@story-4-you-man
@story-4-you-man 7 ай бұрын
@@arvindh13 can you explain this? program?
@JamesDonkor-edu
@JamesDonkor-edu 11 ай бұрын
hey great video! would this work for other languages?
@danielcasas7343
@danielcasas7343 8 ай бұрын
I also hace the same question
@workstation-s2n
@workstation-s2n 4 ай бұрын
Could you please post a link to the full file
@jimjones26
@jimjones26 Жыл бұрын
I am going to work on adding a "trigger" word that will let the program know I want to act.
@CSCoach
@CSCoach Жыл бұрын
That's cool!! If you get that to work, I'd love to know how
@mosawarjamshady2989
@mosawarjamshady2989 Ай бұрын
Hey did you end up figuring this out?
@fransuacordero5407
@fransuacordero5407 5 ай бұрын
one question, this works only for english lenguage? or can be used with other languages?
@Quagik
@Quagik 3 ай бұрын
its saying AttributeError: module 'speech_recognition' has no attribute 'Recognizer'
@ernstb1234
@ernstb1234 2 ай бұрын
It keep telling me Zach command not found when I try installing the dependency please help
@mohamed_Sultan1504
@mohamed_Sultan1504 8 ай бұрын
Does anyone know how to out it in a user-friendly app so anyone can use it easily?
@Ddetektiv
@Ddetektiv Жыл бұрын
how do you get terminal? When I clicked on terminal, new shows me and there was PS C:\Users\[my name]>, but as I can see, in ur it's not. Any way how to fix it?
@snaxsammy6472
@snaxsammy6472 Жыл бұрын
At the top bar of the terminal where it says "problems" click on the drop down arrow next to the "+" in the further right of that bar, then click on "command prompt", you should be able to run the code through there. At least thats what I did and it worked for me.
@Kenoki-yi3gf
@Kenoki-yi3gf 7 ай бұрын
i am doing this on windows os and i cant seem to find where the text is saved.... can someone please help
@Hbdisaster_28
@Hbdisaster_28 6 ай бұрын
For me whenever i say something, the terminal shows wrote text but idk where it writes the text(im on windows btw) can anyone plz help
@arpitv2003
@arpitv2003 6 ай бұрын
it would be creating a text file with name "output.txt" and writing into it
@Hbdisaster_28
@Hbdisaster_28 6 ай бұрын
Thank you
@sicfrydred
@sicfrydred Жыл бұрын
what program did you use to do this?
@CSCoach
@CSCoach Жыл бұрын
Did it in python :)
@Bartdu59Gaming
@Bartdu59Gaming Жыл бұрын
He used "VS Code" and the language used for the program is Python.
@kavito5947
@kavito5947 Жыл бұрын
@@Bartdu59Gamingwhats the name of the terminal he used?
@usus8420
@usus8420 8 ай бұрын
hi it's only desktop solution ?
@hjoseph777
@hjoseph777 3 ай бұрын
You did Not show much also where is the Git repository link. Smartie
@Lunaraa0
@Lunaraa0 Жыл бұрын
Hi oscar does it works for french
@OmarAbdelrazek-tn9du
@OmarAbdelrazek-tn9du Жыл бұрын
I still don’t understand how to download the libraries, I have a Mac m2 and documentation u gave didn’t help me. So can u or anyone else help me?
@felixforster5836
@felixforster5836 Жыл бұрын
You have to type in the terminal "pip3 install ...."
@adrio1569
@adrio1569 Жыл бұрын
How can we add languages to it?
@arvindh13
@arvindh13 9 ай бұрын
You need pre designed language libraries or you need to make a new one with a lot of data.
@mohamed_Sultan1504
@mohamed_Sultan1504 8 ай бұрын
A real king
@user-db4nm2rp2w
@user-db4nm2rp2w 10 ай бұрын
if i want to do only speech to text, then I don't need to install pyaudio, right??
@arvindh13
@arvindh13 9 ай бұрын
Yes, you need Pyaudio & Speech recognition fork library
@i.a_n_i_l_k_u_m_a_r
@i.a_n_i_l_k_u_m_a_r 10 ай бұрын
How to stop recording the voice?
@suissegarantiegaming2100
@suissegarantiegaming2100 Жыл бұрын
Could this also work with a other Language?
@CSCoach
@CSCoach 10 ай бұрын
Programming language? Or the text is outputted to a different language? Either way I'm pretty sure the answer would be yes :)
@KNOCKOUT-t7e
@KNOCKOUT-t7e 8 ай бұрын
what app he used in this video??
@stylloofdreams975
@stylloofdreams975 8 ай бұрын
Visual Studio Code
@KutuluTuk
@KutuluTuk Жыл бұрын
i couldnt import pyttsx3 it says unused and alot of error there 😂
@arvindh13
@arvindh13 9 ай бұрын
you need to install it, it's not an inbuilt library.
@falcongold2024
@falcongold2024 9 ай бұрын
why import pyttsx3? it is never used?
@arvindh13
@arvindh13 9 ай бұрын
It's not required yes.
@manchishirisha8013
@manchishirisha8013 5 ай бұрын
where are u writing the code
@ruzix_yeager4938
@ruzix_yeager4938 23 күн бұрын
vscode
@WolFX_FPS
@WolFX_FPS Жыл бұрын
I have an error with the speech_recognition library: r.adjust_for_ambient_noise(source2, duration=0.2) TypeError: Recognizer.adjust_for_ambient_noise() missing 1 required positional argument: 'source' Any ideas?
@arvindh13
@arvindh13 9 ай бұрын
You need to provie the source before that, please check the program again and if it does not work, install Speech recognition fork library.
@gleful262
@gleful262 7 ай бұрын
got this error, fixed it by adding() to Recognizer
@thunderchills640
@thunderchills640 4 ай бұрын
thanks
@dungnguyen-si2sz
@dungnguyen-si2sz 8 ай бұрын
i am getting trouble with this error: 1 error generated. error: command '/usr/bin/clang' failed with exit code 1 [end of output] note: This error originates from a subprocess, and is likely not a problem with pip. ERROR: Failed building wheel for pyaudio Failed to build pyaudio ERROR: Could not build wheels for pyaudio, which is required to install pyproject.toml-based projects I cant install the two last kind of stuffs you gave us, hope you reply soon, thank you for that nice work sir.
@mikethompson6455
@mikethompson6455 8 ай бұрын
Getting same error. Hope someone can help
@mikethompson6455
@mikethompson6455 8 ай бұрын
Did this. I have Mac OS so If you're using macOS, you can install the necessary tools using Homebrew. First, make sure you have Homebrew installed, then run: brew install portaudio After installing portaudio, you can try installing PyAudio again using pip3 install pyaudio.
@gleful262
@gleful262 7 ай бұрын
had this issue on widows fixed it by running as admin
@odogbolahan8148
@odogbolahan8148 7 ай бұрын
my name is oscar too haha
@hebataha5197
@hebataha5197 7 ай бұрын
Does this work offline ?
@brianmckeown70
@brianmckeown70 4 ай бұрын
^
@hjoseph777
@hjoseph777 3 ай бұрын
Your screen not very clear
@soham0726
@soham0726 8 ай бұрын
source code
@phpsolutioncode9309
@phpsolutioncode9309 Ай бұрын
I continue your work ! thank you! for your video was essential for my Idea. kzbin.info/www/bejne/jWetlHxvhtxrl7c
I Built a Personal Speech Recognition System for my AI Assistant
16:32
Best FREE Speech to Text AI - Whisper AI
8:22
Kevin Stratvert
Рет қаралды 1 МЛН
The evil clown plays a prank on the angel
00:39
超人夫妇
Рет қаралды 53 МЛН
How Strong Is Tape?
00:24
Stokes Twins
Рет қаралды 96 МЛН
How I Would Learn Python FAST (if I could start over)
12:19
Thu Vu data analytics
Рет қаралды 677 М.
5 Python Libraries You Should Know in 2025!
22:30
Keith Galli
Рет қаралды 78 М.
Create Stunning Python GUIs in 10 Minutes With Drag & Drop
11:38
Coding Is Fun
Рет қаралды 108 М.
Run your own AI (but private)
22:13
NetworkChuck
Рет қаралды 1,8 МЛН
Speech To Text using ESP32
13:57
techiesms
Рет қаралды 67 М.
5 Good Python Habits
17:35
Indently
Рет қаралды 683 М.
3 PYTHON AUTOMATION PROJECTS FOR BEGINNERS
17:00
Internet Made Coder
Рет қаралды 1,7 МЛН
Modern Graphical User Interfaces in Python
11:12
NeuralNine
Рет қаралды 1,6 МЛН
Create a LOCAL Python AI Chatbot In Minutes Using Ollama
13:17
Tech With Tim
Рет қаралды 135 М.