OpenAI Whisper model: ASR for many languages AND other languages to English translation model

  Рет қаралды 2,922

Prabhjot Gosal

Prabhjot Gosal

Күн бұрын

Пікірлер: 10
@dr-bijay-kumar-singh
@dr-bijay-kumar-singh 6 ай бұрын
I got no error message, but at "print(len(transcripts_all_NoFall))" the output is zero. Although I have kept 10 English audio samples in the directory. Because of this when I proceed further the output for WER is empty results without any error message. Can you share how you kept the audio samples in your directory. I am using your code on colab.
@prabhjotgosal2489
@prabhjotgosal2489 6 ай бұрын
Hi - The audio samples are simply kept as .wav files in the directory that I am reading from. This directory exists in my Google drive. There are few ways to help root cause your issue further. 1. Before you do any ASR with Whisper.. check if audios are properly read. You could use librosa.load( ) to do so.. Check the output of the librosa.load() to see if the values in the audio files are non-zero. 2. Make sure the audio files contain speech. If there is no speech detected by the Whisper model in the audio, it will output nothing for the transcripts.
@ABAnuSaraReality
@ABAnuSaraReality 7 ай бұрын
Hi, i have doubt . I uploaded the files in google collab via google drive , but the upload files and folders are not displaying even i auth the google drive with google collab , it doesn't show. Is it ok to use vscode as i am familar with the vscode functionality? Or is it compulsory to submit via google collab or can i upload the vs code executed files in google collab ?
@prabhjotgosal2489
@prabhjotgosal2489 7 ай бұрын
Hi - What error do you get when you try to run the code in colab? I am not sure I understand what you mean by, "the vs code executed files in google collab", Are you refereeing to audio files or something else?
@ABAnuSaraReality
@ABAnuSaraReality 7 ай бұрын
@@prabhjotgosal2489 I am referring about mounting the whole project in google collab. but i can't properly auth the google drive in google collab and mount them.
@prabhjotgosal2489
@prabhjotgosal2489 7 ай бұрын
@@ABAnuSaraReality You can run the entire code in VScode locally on your machine. You may have to adjust some syntax and ofcourse the filepath. I suggest creating a new file in VScode and copy/paste sections of code from my file little by little, rather than running the original file all at once. It will make debugging and understanding the code easier.
@lavkushdas5529
@lavkushdas5529 Жыл бұрын
hey getting error as " FileNotFoundError: [Errno 2] No such file or directory: '/content/drive/My Drive/Whisper_Test/NoFall' " how to resolve?
@prabhjotgosal2489
@prabhjotgosal2489 Жыл бұрын
Hi.. you will need to adjust the file path.. Change it to wherever the audio file you are trying to process is located on your machine or Google drive if you are using colab.
@CricketExpress-hx4mh
@CricketExpress-hx4mh Жыл бұрын
@@prabhjotgosal2489 hey can you provide the test audio for this?
@jatinjoshi7549
@jatinjoshi7549 4 ай бұрын
can i get the dataset used by you
Conformer: Convolution-augmented Transformer for Speech Recognition #nlp
12:47
OpenAI Whisper: Robust Speech Recognition via Large-Scale Weak Supervision | Paper and Code
1:02:42
ТВОИ РОДИТЕЛИ И ЧЕЛОВЕК ПАУК 😂#shorts
00:59
BATEK_OFFICIAL
Рет қаралды 6 МЛН
Муж внезапно вернулся домой @Oscar_elteacher
00:43
История одного вокалиста
Рет қаралды 7 МЛН
How to Install & Use Whisper AI Voice to Text
12:44
Kevin Stratvert
Рет қаралды 513 М.
Fine-tuning Whisper to learn my Chinese dialect (Teochew)
28:10
Efficient NLP
Рет қаралды 8 М.
Understanding Speech Recognition using OpenAI's Whisper Model
38:50
How to use ChatGPT to learn ANY Language (new update)
13:26
Matt Brooks-Green
Рет қаралды 66 М.
Why Agent Frameworks Will Fail (and what to use instead)
19:21
Dave Ebbelaar
Рет қаралды 92 М.
Best FREE Speech to Text AI - Whisper AI
8:22
Kevin Stratvert
Рет қаралды 1 МЛН
Fine tuning Whisper for Speech Transcription
49:26
Trelis Research
Рет қаралды 26 М.
GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem
19:15
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
What are AI Agents?
12:29
IBM Technology
Рет қаралды 716 М.