Audio Signal Processing for Machine Learning

  Рет қаралды 209,735

Valerio Velardo - The Sound of AI

Valerio Velardo - The Sound of AI

Күн бұрын

Пікірлер: 117
@araaudio
@araaudio 4 жыл бұрын
When I found your Channel I found a great treasure. When you choose a subject you are playing on my soul strings. Really I am grateful.
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 4 жыл бұрын
Thanks a lot Ara!
@chukypedro818
@chukypedro818 3 жыл бұрын
It took me 3*10*8 secs to subscribe to your channel because you are a lifesaver. exactly what I needed for my side project at this point.
@normalperson1130
@normalperson1130 4 жыл бұрын
This is exactly what I was thinking of working on as a machine learning project. Thanks.
@tyhuffman5447
@tyhuffman5447 4 жыл бұрын
Love it! I'm a vibration analyst, we listen to machines. Thank you for putting this together.
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 4 жыл бұрын
Glad you like it Ty!
@suyashramteke3588
@suyashramteke3588 4 жыл бұрын
I would like to thank you and appreciate for all the effort that you are taking to make these videos. I am a graduate student who is passionate about signal processing and machine learning in audio and your content is the best so far I have found. Since there are very less resources these are all the more helpful
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 4 жыл бұрын
Thank you Suyash -- I'm glad you're finding my videos useful!
@NierChristopher
@NierChristopher 11 ай бұрын
Thanks very much for this series. Your code example and patient explanation really helped me a lot for getting started in the field of AI audio signal processing
@rafaelsetyan1755
@rafaelsetyan1755 3 жыл бұрын
I have just finished the series and I am telling you if you started, go until the end. Thanks a lot Valerio
@D12075
@D12075 2 жыл бұрын
Wow, thank you for putting this together! Though I'm a musician, I don't have a background in audio signal processing and it's been a struggle to find a good compiled source of information for deep learning specific to audio domain problems.
@vadimshatov9935
@vadimshatov9935 Жыл бұрын
I just found an incredible KZbin channel. Thank you so much!
@proyectosinformatica2
@proyectosinformatica2 4 жыл бұрын
Wow, I was watching the main series of Audio classification with Python you made. And when you explained the MFCCs passing as input to a CNN in your example you put this [100, 13, 1] as an input shape and you said the 100 came from the samples in the audio file (51200) / the hop length of 512 and I was about to ask where did the 51200 come from. Maybe with this series I'll be able to understand that and also the Mel Spectrogram which is something that I wanted to know about for the longest time from a non-so mathematical perspective, but rather a more code applicable one. Greetings to you and thank you very much
@BirnieMac1
@BirnieMac1 11 ай бұрын
Thank you for this series chief; I'd been looking at trying to build a detection algorithm using a fourier transform to produce spectrograms and the theory refresher has been super super useful Plus confirmation that I wasn't barking up the wrong tree into how to approach it is always nice
@BalamurugaMuthumani
@BalamurugaMuthumani 4 жыл бұрын
Very much excited for this series
@smilebig3884
@smilebig3884 4 жыл бұрын
I was desperately waiting for this series. 😀
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 4 жыл бұрын
Glad to hear that!
@TheElfurio
@TheElfurio 3 жыл бұрын
Thanks a lot for your great series, Valerio! This helps me a lot with my current signal processing project.
@anthonychianain2241
@anthonychianain2241 2 ай бұрын
Thank you very much for the course... have been looking for something like this for a long time
@frostvision322
@frostvision322 4 жыл бұрын
Very excited for this Valerio! Your videos have helped me immensely at understanding audio and ML/DL in general. Quick question: Do you have any recommendations or resources or plan to contribute some discussion about adding environmental noise to audio? I would love to talk about data augmenting with noise / other sound files into existing signals with proper RMS and SNR as to not overpower the desired signal. I'd love to see how to add radio static / random noises into audio for purpose of simulating target radio speech data. Thanks for everything!
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 4 жыл бұрын
I don't think I'll cover data augmentation in this series. Unfortunately, I don't a have a reference for noise adding. However, you can find papers which tangentially talk about it. I'm thinking of papers which, for example, introduce denoising autoencoder architectures.
@frapastique
@frapastique 4 жыл бұрын
Awesome, you’re covering the topics that I need for my actual project. Thank you and keep it up
@DylanTallchief
@DylanTallchief 4 жыл бұрын
Awesome, can't wait :)
@danji9485
@danji9485 3 жыл бұрын
I found you
@skyscape2087
@skyscape2087 2 жыл бұрын
tf you doin here dylan?
@BorisGrishenco
@BorisGrishenco 2 жыл бұрын
You are amazing!. Super interesting topics including regarding how to publish a paper.
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 2 жыл бұрын
Thanks :)
@shubh4793
@shubh4793 4 жыл бұрын
I am working on a project and man this is reall helpful. Thankew so much
@hamidmojarrad
@hamidmojarrad 7 ай бұрын
Your channel sounds exactly what I was looking for as an ASR researcher. Thank you for these informative videos. Do you have tutorials on Montreal Forced Alignment too?
@islamicinterestofficial
@islamicinterestofficial 4 жыл бұрын
Waiting for next videos.....
@Crytoma
@Crytoma 3 жыл бұрын
Can't thank you enough for this series.
@sidalibourenane5377
@sidalibourenane5377 Жыл бұрын
Hey Mr Hope you doing good ! Please Can you help me ? How Can we use speech recognition to detect falling in elderly people ? Just another question how to combine audio with image to implement fall detection ?? Thank you
@ManontheBroadcast
@ManontheBroadcast 4 жыл бұрын
Love it already! ...looking forward to the next video...
@wildananugrah
@wildananugrah 2 жыл бұрын
what a great content, i really need this series, thank you for your effort to make this series
@chahinezhigoun1078
@chahinezhigoun1078 4 жыл бұрын
Very interesting Can't wait for it
@ahmeterdonmez9195
@ahmeterdonmez9195 3 ай бұрын
A mine of information I found by chance 💪
@muratcan__22
@muratcan__22 4 жыл бұрын
Thanks for these series. This is what I needed project that I had in my mind. Please don’t pull yourself back while going deep on theory :) I like to learn the theory along with practical examples.
@torshamondal8983
@torshamondal8983 4 жыл бұрын
Can you please do a video on speech emotion recognition...I have already seen your speech recognition and that is not enough for emotion classification...but a good one
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 4 жыл бұрын
Hi Torsha, I may cover speech emotion recognition in the future. The closest problem I've tackled on this channel is genre classification n my DL for Audio series. You can re-use most of those concepts for your problem.
@KordTaylor
@KordTaylor 8 ай бұрын
Really a great resource here. Thank you! 👏🏻👏🏻👏🏻
@sankalpbhamare3759
@sankalpbhamare3759 2 жыл бұрын
Thank you so much for the series they are really amazing and organized!!
@jamalan7417
@jamalan7417 Жыл бұрын
new to the subject here with some (limited) dsp knowledge. Which of your playlists should i start with ?
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI Жыл бұрын
This one is a great start.
@apidas
@apidas Ай бұрын
thank you Valerio, a lot of knowledge in this series. I'd donate superchat to this video but you don't have them enabled.
@junjiesang
@junjiesang Жыл бұрын
I want to know about how to do Classification of heart sounds and Improve accuracy. Is the method used for heart sound classification the same as for audio classification?
@CarliCode
@CarliCode 4 жыл бұрын
Thank you so much! This is very helpful to my project degree
@irtsamghazi606
@irtsamghazi606 2 жыл бұрын
This is an awesome series!
@akshaya3086
@akshaya3086 2 жыл бұрын
Please cover Discrete Wavelet transform (DWT) feature extraction method
@junjiesang
@junjiesang Жыл бұрын
I want to know about how to do Classification of heart sounds and Improve accuracy
@shell923shock2
@shell923shock2 4 жыл бұрын
Can't wait for the next one!
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 4 жыл бұрын
It'll be next week, first thing on Monday :)
@pyaephyo3633
@pyaephyo3633 Жыл бұрын
Could you please share about speech analyzer like elsa ? I am waiting for it,sir.
@bytecauldron
@bytecauldron 3 жыл бұрын
If I could like a video twice, I would.
@amrousimen7170
@amrousimen7170 3 жыл бұрын
it seems an execellent contente, i will start your course, i am really motivated and excited, thanks a lot
@anshulnayak9412
@anshulnayak9412 4 жыл бұрын
Hi Valerio, I am planning to extract feature from a neural series like EMG or EEG. Can the same techniques be applied for neural series signals?
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 4 жыл бұрын
Yes, several of the techniques I discuss can be applied to EMGs/EEGs.
@durgaganesh423
@durgaganesh423 2 жыл бұрын
Hi Could you help me How to find abnormalities from audio file Is it possible from ML?
@nerox8580
@nerox8580 2 жыл бұрын
Hello, thank you for the tutorial. I am using java for developping android Apps and I am searching for an android library for sound comparison. I have tried "musicg" library but it gives bad results. Do you know any alternative ? thank you again
@marinachau5359
@marinachau5359 3 жыл бұрын
Thank you so much for this video!!!
@explorerars4208
@explorerars4208 2 ай бұрын
I subscribed you when you said i love python
@georgesmith281
@georgesmith281 4 жыл бұрын
Amazing content and so well explained!! how can i implement this knowledge in speaker verification? I already followed the previous videos and created a project which so far tries to identify the spoken user but despite the fact that im getting 1.00 accuracy the system still fails to identify correctly. Can anyone help?
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 4 жыл бұрын
Hi George, thank you! Serendipitously, a thread on speaker verification just came out on TheSoundofAI Slack community in the #advice channel. I suggest you go check that out and join the conversation ;)
@soumyadrip
@soumyadrip 4 жыл бұрын
💕💕💕 Thank you for the playlist.
@sabrinahuda7308
@sabrinahuda7308 3 жыл бұрын
YOU SAVED MY FYP !
@maxlambiel
@maxlambiel Жыл бұрын
Do you know if AI could make polyphonic audio to MIDI faster and better? And what about the restoration of very old and mangled recordings? I really want to try to work on this if I can but it feels like a steep curve.
@vandaliztik9266
@vandaliztik9266 3 жыл бұрын
what are the prerequisite fundamental courses for this series video?
@omkarspowar7500
@omkarspowar7500 4 жыл бұрын
Please can u also add a video on basic sound detection and algorithm used to do that...
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 4 жыл бұрын
This series will mainly be focused on audio DSP. You can check out my series on DL for Audio to get an idea of some of the architectures you can use for audio classification.
@aussieronnied
@aussieronnied 4 жыл бұрын
Excellent content, keep it up! :D
@arbteampraetorians3452
@arbteampraetorians3452 3 жыл бұрын
i like this series before that i watched hhhhhhh nice introduction dude
@Matter743
@Matter743 4 жыл бұрын
great playlist!! .....can you suggest me like how toapply wavelet transform to sudio signal using python ?? is there any way ?
@shivrankrishen
@shivrankrishen 3 жыл бұрын
Where my fellow Electrical Engineers at?
@ptwnight9326
@ptwnight9326 5 ай бұрын
Hereeeeeeeeeeeee
@giuseppemagistro7733
@giuseppemagistro7733 3 жыл бұрын
Penso che tu sia italiano. Io sono un linguista, mi occupo di fonologia. Grazie per il tuo lavoro!
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 3 жыл бұрын
Grazie Giuseppe!
@tombalvin6344
@tombalvin6344 Жыл бұрын
Hey brotha is there a way I can send you this audio I captured and can you tell me what’s really going on
@haohuynhnhat3881
@haohuynhnhat3881 3 жыл бұрын
Great stuff, thank you for your effort.
@kushalgalipally3510
@kushalgalipally3510 Жыл бұрын
We are trying to develop a robot , can we use this topic for understanding how to develop algorithm for the robot to listen to un natural sounds etc (not speech recognition)
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI Жыл бұрын
Yes, you'll find the foundations for what you need here.
@arunmehta8234
@arunmehta8234 3 жыл бұрын
We are working for an project Covid 19 and acoustics. Do you have any suggestions for us?
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 3 жыл бұрын
You can check out this video I made on the topic kzbin.info/www/bejne/iZzdpqmXaMibf68
@shayanthrn
@shayanthrn 3 жыл бұрын
Good luck sir
@amilkarherrera9804
@amilkarherrera9804 4 жыл бұрын
This is great content! Thank you very much. Is there a book that you would advise someone who wants to learn signal processing for time series data?
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 4 жыл бұрын
Thanks! Sorry, but I'm not aware of any such book. If you'd like to get a few ideas about general audio / music signal processing, you can check out this video where I suggest a few books on the topic kzbin.info/www/bejne/oJOYd2eGm9qFqNk Some of the ideas you'll learn there, can be re-used for time series data.
@techsambd4058
@techsambd4058 4 жыл бұрын
can you make Audio classification using efficientNet. it would be better for us to understand efficient net
@subramanyabhattm4626
@subramanyabhattm4626 3 жыл бұрын
Is there any udemy or coursera or edureka courses on audio files processing? Suggestion please
@ashwanirathee508
@ashwanirathee508 4 жыл бұрын
This is good stuff 👍
@WannabePianistSurya
@WannabePianistSurya 4 жыл бұрын
Can you talk about Google Magenta's Differentiable Digital Signal Processing and how it works? I think it is too underrated tbh.
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 4 жыл бұрын
That's an amazing application. I won't touch on it in this series, but definitely cover the subject in a video at some point.
@MsBalajiv
@MsBalajiv 4 жыл бұрын
Hi... Do you have a similar course material for learning speech recognition using Hidden Markov Models (HMM)?
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 4 жыл бұрын
No, I don't.
@swapnilbhabal5289
@swapnilbhabal5289 3 жыл бұрын
one question, is calculating human singing potential is feasible by use of deep learning ??
@swapnilbhabal5289
@swapnilbhabal5289 3 жыл бұрын
To simplify this more --> The model will extract feature and classify the audio in one of the three groups whether its [Good,Bad,AVG] i would like to know is making such kinda model is feasible ????
@xiangli1133
@xiangli1133 2 жыл бұрын
Thanks, brother, for your generousity! You deserve more than a subscription. I'm from a computer vision background your tutorial helped me a lot! Yet I can't able to join the soundofai community. Could you check the problem?
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 2 жыл бұрын
I've tried out the link, It seems to work for me. Can you let me know what kind of issue you're experiencing?
@naim2083
@naim2083 3 ай бұрын
Hello are the next videos of the playlists obsolete?
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 3 ай бұрын
No.
@naim2083
@naim2083 3 ай бұрын
@@ValerioVelardoTheSoundofAI thanks you for the free courses
@saumya3470
@saumya3470 4 жыл бұрын
can you make a tutorial for Kaldi
@rahulbpillai22
@rahulbpillai22 4 жыл бұрын
please can you make a tutorial series on speech synthesis using cnn and thank you for this wonderful content.
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 4 жыл бұрын
Thanks! I've just started a series on sound generation with variational autoencoders.
@alexkokh1501
@alexkokh1501 2 жыл бұрын
How could i get in touch with you?
@catlord69
@catlord69 3 жыл бұрын
you are *amazing* !
@lulululu4912
@lulululu4912 4 жыл бұрын
Hello, very interesting subject and video. I have an hugely important request to adress you. Is it possible to train an A.I so that it can compare a reference signal (input source signal) with an output signal (that is the modified result of the source signal) so it can calculate an corresponding signal for the output signal to be the same as the input signal? It may not be super clear but basically, the functional chain is: -an input signal that is a source of audio content -an output signal that is the audio content (input signal) but processed in a way that distord it temporally (out of phase for some part of the signal) and in amplitude. -a closed feedback loop that send the exact copy of the distorded output signal to a core processing unit that contain the ai. This feedback loop sits right in between the input signal and output signal so that the AI can align (by injecting a new signal to the input signal, a compensating signal) almost immediately the output signal with the input. I'm a student and hobbyist and your reply could really be a greatly appreciated and precious contribution to my project Thanks
@codecomedytv1998
@codecomedytv1998 4 жыл бұрын
Just 👏in 👏time 👏
@凌璃-b8z
@凌璃-b8z 3 жыл бұрын
love it
@mimikoko4299
@mimikoko4299 4 жыл бұрын
Thanks alot
@akashdhage
@akashdhage 4 жыл бұрын
I have tried signing in thesoundofai.slack.com ,however got reply as email id not exist with slack.com
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI 4 жыл бұрын
Can you try this link? kzbin.info?event=video_description&v=iCwMQJnKk2c&q=https%3A%2F%2Fjoin.slack.com%2Ft%2Fthesoundofai%2Fshared_invite%2Fzt-f71npumr-anli6W4QCuZ8UCj2gLoBkw&redir_token=QUFFLUhqbjBEMUY2clB3UDhsZklLN29feEdaN1h0RXBFd3xBQ3Jtc0tsVHFzaXFtT2hvRC12eEYwRVdNSDFyOWJQb2J2ZGw3SnhIOXY0UmEwY3FKNmFXZ3R5REpvaFVHbXRhVVFndWJRWjdXYV9rdkdIRUljWTRXckhlV19vVVQwZ1VSSEJ2VFk2SFBTT0hTamh1SG5wUkRZbw%3D%3D
@NikitaBeekle
@NikitaBeekle Ай бұрын
please send code for this
@ValerioVelardoTheSoundofAI
@ValerioVelardoTheSoundofAI Ай бұрын
Check the description box.
@Bihari_Chaman
@Bihari_Chaman 2 жыл бұрын
You do not work with MATLAB? I'm not using machine learning. Will this help me?
@cravinadventure
@cravinadventure 3 жыл бұрын
GOLD MINE
@Ruhgtfo
@Ruhgtfo 3 жыл бұрын
cool
Sound and Waveforms
26:53
Valerio Velardo - The Sound of AI
Рет қаралды 88 М.
Mel Spectrograms Explained Easily
30:31
Valerio Velardo - The Sound of AI
Рет қаралды 101 М.
I thought one thing and the truth is something else 😂
00:34
عائلة ابو رعد Abo Raad family
Рет қаралды 11 МЛН
Confronting Ronaldo
00:21
MrBeast
Рет қаралды 21 МЛН
А я думаю что за звук такой знакомый? 😂😂😂
00:15
Денис Кукояка
Рет қаралды 5 МЛН
Intensity, Loudness, and Timbre
37:14
Valerio Velardo - The Sound of AI
Рет қаралды 60 М.
Understanding Audio Signals for Machine Learning
25:16
Valerio Velardo - The Sound of AI
Рет қаралды 60 М.
Audio Data Processing in Python
19:52
Rob Mulla
Рет қаралды 170 М.
Types of Audio Features for Machine Learning
22:42
Valerio Velardo - The Sound of AI
Рет қаралды 70 М.
Demystifying the Fourier Transform: The Intuition
37:17
Valerio Velardo - The Sound of AI
Рет қаралды 43 М.
Mel-Frequency Cepstral Coefficients Explained Easily
57:43
Valerio Velardo - The Sound of AI
Рет қаралды 130 М.
Understanding Time Domain Audio Features
19:41
Valerio Velardo - The Sound of AI
Рет қаралды 48 М.
How I’d learn ML in 2024 (if I could start over)
7:05
Boris Meinardus
Рет қаралды 1,2 МЛН
Short-Time Fourier Transform Explained Easily
34:47
Valerio Velardo - The Sound of AI
Рет қаралды 78 М.
Learn Machine Learning Like a GENIUS and Not Waste Time
15:03
Infinite Codes
Рет қаралды 116 М.
I thought one thing and the truth is something else 😂
00:34
عائلة ابو رعد Abo Raad family
Рет қаралды 11 МЛН