Google Cloud Speech-To-Text API With Python For Beginners

  Рет қаралды 32,999

Jie Jenn

Jie Jenn

Күн бұрын

Пікірлер: 31
@frankking5326
@frankking5326 8 ай бұрын
After two days fighting with this on my own, your video solved my problem!! Thanks
@mariodevelopersantos1102
@mariodevelopersantos1102 10 ай бұрын
this is exactly what I needed, thanks
@ashishkumar-eo1tz
@ashishkumar-eo1tz Жыл бұрын
Simply awesome. Keep up the good work bro 👍
@TheCloudShepherd
@TheCloudShepherd 6 ай бұрын
Excellent. Thank you very much
@DungLe-rp5vu
@DungLe-rp5vu 8 ай бұрын
Your instructions are great, besides how can I get text from online mp3 link?
@user-wr4yl7tx3w
@user-wr4yl7tx3w Жыл бұрын
by chance, do you have any idea what Industry Coding Assessment is, as part of an interview? I was told that it is not the same as testing you on algorithms, like in LeetCode.
@jiejenn
@jiejenn Жыл бұрын
Each company is different, so there's no defined answer to be honest.
@TheCloudShepherd
@TheCloudShepherd 6 ай бұрын
Google documentation sucks. Thanks for this clearly explained how-to video
@jiejenn
@jiejenn 6 ай бұрын
Glad the video helped.
@Grams79
@Grams79 Ай бұрын
Thank you!
@g_30_sanketpatil62
@g_30_sanketpatil62 5 ай бұрын
hey bro i have created a voice bot using google dialogflow cx, now I wanted to transcribe the ongoing voice call so can you please tell me how can I achieve it thanks
@RicardoCrumbleton
@RicardoCrumbleton 8 ай бұрын
How could this be adapted for the v2 api with Chirp?
@jessenorris1672
@jessenorris1672 29 күн бұрын
Hi there could you make a video on adding the speech to text API to a discord channel, that would help me out a lot thanks in advance.
@JuanFernandoCuetoHuaringa-i7e
@JuanFernandoCuetoHuaringa-i7e 7 ай бұрын
Donde se encuentra la interfaz de usuario?
@ashishprakash8430
@ashishprakash8430 9 ай бұрын
hii, i follow your tutorial but it is not transcribing all audio.. please help.
@frankking5326
@frankking5326 8 ай бұрын
what error did you get?
@ashishprakash8430
@ashishprakash8430 8 ай бұрын
@@frankking5326 found the solution, I was using default model. Video model worked. Thanks for your tutorial.
@jubileudasilva9258
@jubileudasilva9258 3 ай бұрын
Speech-to-Text has three main methods to perform speech recognition. These are listed below: Synchronous Recognition (REST and gRPC) sends audio data to the Speech-to-Text API, performs recognition on that data, and returns results after all audio has been processed. Synchronous recognition requests are limited to audio data of 1 minute or less in duration. Asynchronous Recognition (REST and gRPC) sends audio data to the Speech-to-Text API and initiates a Long Running Operation. Using this operation, you can periodically poll for recognition results. Use asynchronous requests for audio data of any duration up to 480 minutes. Streaming Recognition (gRPC only) performs recognition on audio data provided within a gRPC bi-directional stream. Streaming requests are designed for real-time recognition purposes, such as capturing live audio from a microphone. Streaming recognition provides interim results while audio is being captured, allowing result to appear, for example, while a user is still speaking.
@MinaNassef-p6r
@MinaNassef-p6r 6 ай бұрын
Is it possible to make it recognize in real-time from a microphone with good performance? Edit: Another Question: Does it support the Arabic language as AWS doesn't in streaming (real-time)?
@jiejenn
@jiejenn 6 ай бұрын
Yeah, it definitely possible, but not going to be cheap though.
@madhav1527
@madhav1527 4 ай бұрын
Hi Did you find out a solution on how to get the input from a microphone, and supporting arabic? in real time, if so do let me know as i am having trouble in implementing the same
@MinaNassef-p6r
@MinaNassef-p6r 4 ай бұрын
@@madhav1527 I used Open AI Whisper
@madhav1527
@madhav1527 4 ай бұрын
​@@MinaNassef-p6r but that has a cost right per api call, could you let me know if you found any library that does it without a cost
@madhav1527
@madhav1527 4 ай бұрын
​​@@MinaNassef-p6r and one more question, how accurate would you say the open ai whisper is
@dhoreys
@dhoreys 4 ай бұрын
My .wav file did not convert. Is there a sample .wav file I could use?
@jiejenn
@jiejenn 4 ай бұрын
You can search on Google, there are plenty.
@dhoreys
@dhoreys 4 ай бұрын
@@jiejenn The samples I have aren't working. I tried those. Gemini is saying to make sure that the file is in LINEAR16 format.
@aastharathod8786
@aastharathod8786 Жыл бұрын
how can i get it for JAVA?
@sadamhussain816
@sadamhussain816 Жыл бұрын
Is it free?
@manualdevalor
@manualdevalor 9 ай бұрын
No
@7BlackJack8
@7BlackJack8 2 ай бұрын
Google couldn't do any better to gets developer away from this. It's an atrocious mess to use the apis
Best FREE Speech to Text AI - Whisper AI
8:22
Kevin Stratvert
Рет қаралды 975 М.
Как мы играем в игры 😂
00:20
МЯТНАЯ ФАНТА
Рет қаралды 973 М.
小丑妹妹插队被妈妈教训!#小丑#路飞#家庭#搞笑
00:12
家庭搞笑日记
Рет қаралды 35 МЛН
Python Speech to Text with Google Cloud Speech
13:31
Parwiz Forogh
Рет қаралды 39 М.
Using Build Time Variables to Create Customer Reports
22:31
Real-time Speech Recognition in 15 minutes with AssemblyAI
19:22
AssemblyAI
Рет қаралды 228 М.
No, Einstein Didn’t Solve the Biggest Problem in Physics
8:04
Sabine Hossenfelder
Рет қаралды 239 М.