OpenAI Whisper model: ASR for many languages AND other languages to English translation model

  Рет қаралды 2,922

Prabhjot Gosal

Prabhjot Gosal

Күн бұрын

Пікірлер: 10
@dr-bijay-kumar-singh
@dr-bijay-kumar-singh 6 ай бұрын
I got no error message, but at "print(len(transcripts_all_NoFall))" the output is zero. Although I have kept 10 English audio samples in the directory. Because of this when I proceed further the output for WER is empty results without any error message. Can you share how you kept the audio samples in your directory. I am using your code on colab.
@prabhjotgosal2489
@prabhjotgosal2489 6 ай бұрын
Hi - The audio samples are simply kept as .wav files in the directory that I am reading from. This directory exists in my Google drive. There are few ways to help root cause your issue further. 1. Before you do any ASR with Whisper.. check if audios are properly read. You could use librosa.load( ) to do so.. Check the output of the librosa.load() to see if the values in the audio files are non-zero. 2. Make sure the audio files contain speech. If there is no speech detected by the Whisper model in the audio, it will output nothing for the transcripts.
@ABAnuSaraReality
@ABAnuSaraReality 7 ай бұрын
Hi, i have doubt . I uploaded the files in google collab via google drive , but the upload files and folders are not displaying even i auth the google drive with google collab , it doesn't show. Is it ok to use vscode as i am familar with the vscode functionality? Or is it compulsory to submit via google collab or can i upload the vs code executed files in google collab ?
@prabhjotgosal2489
@prabhjotgosal2489 7 ай бұрын
Hi - What error do you get when you try to run the code in colab? I am not sure I understand what you mean by, "the vs code executed files in google collab", Are you refereeing to audio files or something else?
@ABAnuSaraReality
@ABAnuSaraReality 7 ай бұрын
@@prabhjotgosal2489 I am referring about mounting the whole project in google collab. but i can't properly auth the google drive in google collab and mount them.
@prabhjotgosal2489
@prabhjotgosal2489 7 ай бұрын
@@ABAnuSaraReality You can run the entire code in VScode locally on your machine. You may have to adjust some syntax and ofcourse the filepath. I suggest creating a new file in VScode and copy/paste sections of code from my file little by little, rather than running the original file all at once. It will make debugging and understanding the code easier.
@lavkushdas5529
@lavkushdas5529 Жыл бұрын
hey getting error as " FileNotFoundError: [Errno 2] No such file or directory: '/content/drive/My Drive/Whisper_Test/NoFall' " how to resolve?
@prabhjotgosal2489
@prabhjotgosal2489 Жыл бұрын
Hi.. you will need to adjust the file path.. Change it to wherever the audio file you are trying to process is located on your machine or Google drive if you are using colab.
@CricketExpress-hx4mh
@CricketExpress-hx4mh Жыл бұрын
@@prabhjotgosal2489 hey can you provide the test audio for this?
@jatinjoshi7549
@jatinjoshi7549 4 ай бұрын
can i get the dataset used by you
Conformer: Convolution-augmented Transformer for Speech Recognition #nlp
12:47
Why Agent Frameworks Will Fail (and what to use instead)
19:21
Dave Ebbelaar
Рет қаралды 92 М.
Do you love Blackpink?🖤🩷
00:23
Karina
Рет қаралды 22 МЛН
Thank you Santa
00:13
Nadir Show
Рет қаралды 41 МЛН
Why no RONALDO?! 🤔⚽️
00:28
Celine Dept
Рет қаралды 91 МЛН
How many people are in the changing room? #devil #lilith #funny #shorts
00:39
Transformers (how LLMs work) explained visually | DL5
27:14
3Blue1Brown
Рет қаралды 3,8 МЛН
Microsoft Just Showed Us How To Use New AI Agents...
13:56
TheAIGRID
Рет қаралды 82 М.
I Forked Bolt.new AI Code Editor and made it way better @ColeMedin
16:47
The Metaverse Guy
Рет қаралды 7 М.
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
new critical linux exploit has been hiding for 10 years.
9:32
Low Level
Рет қаралды 150 М.
ChatGPT FINALLY Sounds Human & More AI Use Cases
18:31
The AI Advantage
Рет қаралды 27 М.
15 INSANE Use Cases for NEW Claude Sonnet 3.5! (Outperforms GPT-4o)
28:54
Think Fast, Talk Smart: Communication Techniques
58:20
Stanford Graduate School of Business
Рет қаралды 42 МЛН
GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem
19:15
Do you love Blackpink?🖤🩷
00:23
Karina
Рет қаралды 22 МЛН