Speech Recognition in Python | finetune wav2vec2 model for a custom ASR model

Рет қаралды 11,763

Python Lessons

Күн бұрын

Пікірлер: 33

@infinitewebrevolution 9 ай бұрын

Thank you so much sir with your hard work and pertained model, it has helped me alot I would always thank you

@PyLessons 9 ай бұрын

Glad to hear that! You are welcome

@filofilo9695 Ай бұрын

Why do you use batch size = 1. Is it just for ram space reason? Great tutorial thank you!❤

@hugok6212 10 ай бұрын

Excellent video and explanation. I have a question, if I train a model this way, can I use it for speech recognition in real time?. Thank you

@PyLessons 10 ай бұрын

Hey, yes and no. If depends on what hardware you'll run model (cpu, gpu or other). It depends on your "real time" requirements. You need to test it and you'll see :)

@SagarPatel-h1g Ай бұрын

After Training and Testing how can we use this onnx model for any Speech to Text Task?

@shafiqrhmankeliwall8019 9 ай бұрын

Hi Great job Keep it up, I have one question that : I want to build/Train model for some low resource languages such as Pashto, I will make a dataset from scratch. any idea how to start or any useful links. Thanks

@PyLessons 9 ай бұрын

Thanks! I do not recommend to make a dataset from scratch alone, I believe you should be able to find something in open source. I don't have dataset, but check my dataset structure and you'll see what format it required

@BrightShoko-m7c 10 ай бұрын

Good job👏..........but i'm getting errors on onnx installation, ....what python version did you use

@PyLessons 10 ай бұрын

I used it with 3.10 python. What error you receive, often it might be related with protobuf version

@konami_cheater Ай бұрын

Can you give me the Paper Link 0:57

@victormessias107 9 ай бұрын

When I'm training, its freezes on the end of the first epoch. Any idea?

@PyLessons 9 ай бұрын

It shouldn't be like that, try to debug it. For example iterate through training data provider and validation data provider, for example "for data in data_provider" and check if it can reach the end. If you still face these issues open issue on GitHub with more details

@AmitYadav-rp3ot Жыл бұрын

Hi there, great video! I wanted to know your opinion on training a model like this just for recognising numbers and couple of words from an audio file. will such a custom training help to reduce the size of the model ? I want to create a very small model so that I can run it on a sub GHz clock CPU. please share what you think. Many thanks

@PyLessons Жыл бұрын

Hi, thanks! No, training model on simpler data doesn't reduce model size. Check my other videos to create your own custom model for simpler data, such as numbers and words. But if your variety of words is simple, maybe you should consider classification task. Also, to reduce size of the model check quantization and pruning techniques

@maimunahmaskur7525 7 ай бұрын

its a great code! Could you please help, if I want to use this code for a dataset labeled phonemes and use PER (Phoneme Error Rate) for test and validation, what should I do? I mean which parts of the code do I need to adjust? Thank You!

@PyLessons 6 ай бұрын

I am not familiar with PER, so I can't tell you

@djrocks5678 Жыл бұрын

Hi there! Thanks a lot for this. I wanted to ask you - I am working on a desktop voice assistant project as part of my university work. I wanted to train my own speech recognition model. How would I go about this? I saw datasets and something like Mozillas 79GB data is too much for my needs and was wondering how I'd go about making a smaller scale speech recognition model for my project.

@PyLessons Жыл бұрын

Hi, usually its impossible to get great results, without huge datasets and GPU computing. But you may try to create a custom ASR model with my another tutorial, what you can check here: kzbin.info/www/bejne/bpjCZX99Z9GjiJo. Also, there are a lot of trained ASR models that usually you need only to integrate (just an idea)

@BASDOURI 8 ай бұрын

your contact please ?

@N3ONGNCS 6 ай бұрын

i want to create an ASR for an African Vernacular/local language ,could i use this for that, ill create my own dataset if need be, or what would you suggest, im attempting this for the first time an am a little lost and overwhelmed

@daisy-bot-py Ай бұрын

Im working on a similar project and im just curious to know if you trained the model in an African language?