Real-time Speech Recognition in 15 minutes with AssemblyAI

  Рет қаралды 229,241

AssemblyAI

AssemblyAI

Күн бұрын

Get your free speech-to-text API token 👇
www.assemblyai...
Transcribing in real-time is a super skill only court reporters can brag about. But luckily, we don’t need to learn how to type fast to get transcriptions of audio quickly. Thanks to Assembly AI’s Streaming Speech-to-Text model (previously real-time speech recognition), it is very simple to set up a python script that can listen for audio and turn it to text.
In this video, we will see how to create this script on Python with the help of pyaudio, web sockets and asynchronous functions. The app will have the power to listen to audio input through a microphone and display the transcription in real-time. We will integrate this code into a simple Streamlit application to showcase the real-time speech recognition with a touch of interactivity.
If you’d like to follow along, don’t forget to get your own AssemblyAI API token for free at assemblyai.com
You can find the code from this tutorial in this GitHub repository: github.com/mis...
Find the written form of this tutorial here: www.assemblyai...
AssemblyAI Streaming STT docs: www.assemblyai...

Пікірлер: 74
@saifullahkhan9837
@saifullahkhan9837 2 жыл бұрын
The accuracy and formatting is quite interesting here.
@AssemblyAI
@AssemblyAI 2 жыл бұрын
Thank you! - Mısra
@debojitmandal8670
@debojitmandal8670 9 ай бұрын
​@@AssemblyAIhi what if I want the input to be not from microphone and i want it from my speaker or laptop speaker how do I do it then.
@pjayo
@pjayo 2 жыл бұрын
Is there a JavaScript version of this video please? Both service side and front end…
@lfmtube
@lfmtube 2 жыл бұрын
Most instructional and useful video. Thank you.
@AssemblyAI
@AssemblyAI 2 жыл бұрын
You're very welcome!
@otomakannioc8213
@otomakannioc8213 Жыл бұрын
Very sympathic and engaging presentation. Maybe the most beautiful side of Artificial Intelligence 😊
@AssemblyAI
@AssemblyAI Жыл бұрын
Thank you!
5 ай бұрын
Thanks for everything :)
@ashiqashervegar7973
@ashiqashervegar7973 Жыл бұрын
How can I use this for transcribing particular chrome tabs for online meetings? Can you help me with that?
@slimyelow
@slimyelow Жыл бұрын
Very kewl it works. However for the live service a $8 minimum is required. - but totally worth it
@Asparuh.Emilov
@Asparuh.Emilov 2 жыл бұрын
This is really awesome! I would prefer though to see the final result as a short highlights at the beginning of your videos before you go into the details of how to. But thanks anyway for the effort and the time! Hugs!
@AssemblyAI
@AssemblyAI 2 жыл бұрын
Thanks for the feedback! It's definitely a good idea to give an impression of the app that is being built. With the newer videos we do a preview at the beginning of the videos indeed. - Mısra
@Asparuh.Emilov
@Asparuh.Emilov 2 жыл бұрын
@@AssemblyAI 🤗🤗♥️♥️
@mehdismaeili3743
@mehdismaeili3743 Ай бұрын
Excellent .
@claudiotassis
@claudiotassis Жыл бұрын
Incredible video. Would I be able to use chatGPT, as an intermediate, to correct the sentences based on vocabulary and grammar, and after that, get the response from that chatGPT "reviewed" sentences?
@mohamedshagie3342
@mohamedshagie3342 Жыл бұрын
Yup i tried to make it but it worked only text cant use speak 😅
@MrThought2012
@MrThought2012 Жыл бұрын
Very nice and easy setup! Took me ages to achieve the same with whisper. However, are you planning to support other languages, german, french or even a multilinugal model?
@omarsiddiqi5018
@omarsiddiqi5018 10 ай бұрын
Can I ask how you were able to do it?
@alexander5429
@alexander5429 Ай бұрын
@assemblyai : When will you finally support Streaming in German?
@fahnub
@fahnub Жыл бұрын
Does it also offer diarization in real time?
@ckames22
@ckames22 2 жыл бұрын
Awesome 👍
@AssemblyAI
@AssemblyAI 2 жыл бұрын
Thank you!
@1992kshitizyadav
@1992kshitizyadav 4 ай бұрын
As of now, only the English language is supported in the live transcription feature. when can we expect more language support ?
@lookersky6145
@lookersky6145 Жыл бұрын
I've this installed and worked on windows. My question is that Real-time Speech Recognition only recognize english ? Does it support other languages ? Thank you.
@Pinkijhabnp
@Pinkijhabnp 10 ай бұрын
Thank you for this nice tutorial
@AssemblyAI
@AssemblyAI 10 ай бұрын
Glad you liked it
@usus8420
@usus8420 5 ай бұрын
hi great works but what about smartphone ?
@PoojaVerma-sl6mg
@PoojaVerma-sl6mg Жыл бұрын
Could you please instruct me on how I can include this in my Angular project?
@KashyapJadav
@KashyapJadav Жыл бұрын
Live transcript is paid version?
@adhikesavan9377
@adhikesavan9377 2 жыл бұрын
when i tried to install pyaudio terminal displays this error: "Cannot open include file: 'Python.h': No such file or directory "
@Miguel-hq1lx
@Miguel-hq1lx 5 ай бұрын
is it possible to transcribe in real-time in other languages, such as spanish?
@weebiesoftware6296
@weebiesoftware6296 4 ай бұрын
I want to implement a realtime app using voice recognition on python 3 / android 11 on my samsung s22. It's my understanding portaudio is NOT supported on Android 11. Is portaudio your only way to get to the mic?
@moncefarajdal4582
@moncefarajdal4582 2 жыл бұрын
Can you please let me know how can I integrate this in my JAVA Maven project?
@AssemblyAI
@AssemblyAI 2 жыл бұрын
Hey Moncef, unfortunately I also don't have experience on that. -Mısra
@GiulianoGolfieri
@GiulianoGolfieri Жыл бұрын
Is it possible to use this service in other languages apart from English?
@tiagofyhnesteves74
@tiagofyhnesteves74 Жыл бұрын
im also trying to find an answer to this question
@GiulianoGolfieri
@GiulianoGolfieri Жыл бұрын
@@tiagofyhnesteves74 they answered to me privately. It's not possible yet. I switched to Azure cognitive services, which is multi-language.
@frizzfrizz3550
@frizzfrizz3550 10 ай бұрын
@@GiulianoGolfieri I had taken it for granted that it was a multilingual service, a fucking morning's work wasted. Grazie della info, Giuliano
@amineelarif7001
@amineelarif7001 2 жыл бұрын
that is sick! goodjob
@AssemblyAI
@AssemblyAI 2 жыл бұрын
Thank you Amine! - Mısra
@spinal_cord
@spinal_cord Жыл бұрын
I know this is a little old, but I get a 4002 error, what might cause that?
@walker71391
@walker71391 Жыл бұрын
Did you ask ChatGPT?
@HomelessRafi
@HomelessRafi Жыл бұрын
How can I introduce um, ahs, and other filler words in to the Realtime transcription? I see it is an option for uploading an audio file
@borr2749
@borr2749 Жыл бұрын
Assembly ai real time transcription doesn't have a free trial ?
@onintsoavola5698
@onintsoavola5698 8 ай бұрын
Is it possible to make it faster ? The transcription takes a little time
@loubino18
@loubino18 6 ай бұрын
Should have mentioned cost to go to pro version.... why hide it?
@GaneshRedagani
@GaneshRedagani Жыл бұрын
can you pls let me know how to save that text
@eagold
@eagold 2 жыл бұрын
buut.. if i have no money to buy the pro key?😕
@AssemblyAI
@AssemblyAI 2 жыл бұрын
You can get started for free!
@MDMUHTADEEFAIAZKHANSOUMIK
@MDMUHTADEEFAIAZKHANSOUMIK Жыл бұрын
Can we setup Bangla language for this system?
@parameswaranesnsce-cse9491
@parameswaranesnsce-cse9491 8 ай бұрын
can we speak any indic languages , will this endpoint will transcribe or not ?
@AssemblyAI
@AssemblyAI 8 ай бұрын
Yes AssemblyAI's API supports Hindi Transcription, check out this tutorial: kzbin.info/www/bejne/aYjPf4J5mt6soLM
@REALVIBESTV
@REALVIBESTV Жыл бұрын
Can this work in Unreal Engine 5
@angelfernando8954
@angelfernando8954 2 жыл бұрын
Hi. how can i change the lenguage to transcript in spanish?
@AssemblyAI
@AssemblyAI 2 жыл бұрын
Hey Angel, here is the documentation on transcribing in languages other than English. docs.assemblyai.com/walkthroughs#specifying-a-language
@dirtydevil81
@dirtydevil81 2 жыл бұрын
@@AssemblyAI But do different languages work with realtime transcription on this specific endpoint? The documentation, regarding changing the language, is not clear about this.
@giovanniied
@giovanniied Жыл бұрын
@@dirtydevil81 do you find a solution?
@rubibeats
@rubibeats Жыл бұрын
how to add custom ui?
@bakhshizade
@bakhshizade 11 ай бұрын
I am here for Freddie.
@IntricateMoon
@IntricateMoon 2 жыл бұрын
I'm on windows, When I try to run it it does nothing, just creates a new line on the terminal. when I cloned the github repo, it was working, hmmm
@AssemblyAI
@AssemblyAI Жыл бұрын
Have you tried speaking while the code is running? It might be that you don't have a microphone connected to the computer.
@siamkamelia87
@siamkamelia87 2 жыл бұрын
does this work for song transcription ? in real time ?
@AssemblyAI
@AssemblyAI 2 жыл бұрын
Hey Siam, depending on the amount of background music and clarity of pronunciation you'd get varying levels of success with transcribing songs.
@marlontuquerres6072
@marlontuquerres6072 2 жыл бұрын
THIS IS ONLY AVAILABLE ON MAC/LINUX, RIGHT?
@AssemblyAI
@AssemblyAI 2 жыл бұрын
No, it is available independent of the operating system.
@ibrahimimohssine8131
@ibrahimimohssine8131 2 жыл бұрын
is assemblyAI support arabic language with vowelization?
@AssemblyAI
@AssemblyAI 2 жыл бұрын
We are launching support for Arabic in late January!
@benyusu8045
@benyusu8045 10 ай бұрын
received 4001 (private use) Not authorized; then sent 4001 (private use) Not authorized
@Homurdan
@Homurdan 2 жыл бұрын
Aha Türk !
@egeyay9470
@egeyay9470 2 жыл бұрын
Ahahahha
@barankaya3333
@barankaya3333 Жыл бұрын
Türk müsün?
@valerozanoni952
@valerozanoni952 Жыл бұрын
When i added this line if json.loads(result_str)['message_type'] == 'FinalTranscirpt': it wouldnt transcript anything anymore
Auto-generating meeting notes with Python
24:05
AssemblyAI
Рет қаралды 5 М.
Or is Harriet Quinn good? #cosplay#joker #Harriet Quinn
00:20
佐助与鸣人
Рет қаралды 58 МЛН
I Built a Personal Speech Recognition System for my AI Assistant
16:32
AI, Machine Learning, Deep Learning and Generative AI Explained
10:01
IBM Technology
Рет қаралды 217 М.
OpenAI Assistants API - Course for Beginners
3:32:55
freeCodeCamp.org
Рет қаралды 376 М.
Add speech recognition to your Streamlit apps in 5 minutes
6:37
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
Run your own AI (but private)
22:13
NetworkChuck
Рет қаралды 1,4 МЛН
Best FREE Speech to Text AI - Whisper AI
8:22
Kevin Stratvert
Рет қаралды 976 М.