Discussion post on Kaggle with brief summary of what I did in the video: www.kaggle.com/c/diabetic-retinopathy-detection/discussion/242755 Timestamps: 0:00 - Introduction 2:45 - Overview of DR and how to detect 9:11 - A look at the data 12:15 - Creating a baseline solution 24:22 - Result from baseline 26:25 - Idea #1: Preprocessing 34:07 - Result #1 35:19 - Idea #2: Loss Function 39:03 - Result #2 39:59 - Idea #3: Balanced Loader (skipped) 41:30 - Idea #4: Augmentation 42:52 - Result #4 43:44 - Idea #5: Using left and right information 51:03 - Result #5 51:50 - Idea #6: Increase image resolution 55:00 - Result #6 from various resolutions 55:10 - Final Result and Ending
@teddybest023 жыл бұрын
Thank you for the great ideas you gave us, I have an extra idea here. Can you show us how we can use Ben Graham(Competition winner) pre-processing method (specially color version) and train the model from different transfer learning models? I think this way the result will be improved more.
@saravanans98953 жыл бұрын
I legit learn more things from a single video of yours than any other source, Please keep making content like this bro.
@Msalaho12 жыл бұрын
hello, dealing with such large datasets, do i have to download this whole data? i mean did you downloaded > 80 GB of data to build your model ?
@mrdbourke3 жыл бұрын
Outstanding video Aladdin! Love the code explanations. Did you record the voice/code at the same time? I enjoyed the multiple lines of code reveal at a time.
@AladdinPersson3 жыл бұрын
I really appreciate you saying that, means a lot coming from you! Yeah maybe it looked fancy but I just removed the lines and pressed "u" for undo a bunch of times and tried to do some code commentary:)
@Amilakasun13 жыл бұрын
LOL you type the same code thousands of times.
@notimportant90183 жыл бұрын
Hi Aladdin, great video. Just had a doubt. Do you have videos where in you take us step by step through the code of these config, util, train, dataset files as in explaining how we could write such codes? If yes please do let me know that would be great. Thanks!
@DiogoSanti3 жыл бұрын
Hey Aladdin, i liked the approach in this video... Take a problem and dissecting it...Good job!
@DiogoSanti3 жыл бұрын
Would be great to make more content in this format...
@gurushreyaass94583 жыл бұрын
Excellent video hope this inspire many amateurs in this field .Thanks. Do post more videos.
@_arnav_chaudhuri8 ай бұрын
Great video sir ,Can you tell me where can i download the dataset of the images from ?
@mikhaeldito3 жыл бұрын
More videos like this, please!
@saketkattuboina66933 жыл бұрын
Can you also make a video on how to ensemble deep learning models??
@arunavamaulik193 жыл бұрын
Absolute brilliant series...more of this please!
@chjayakrishnajk Жыл бұрын
Please continue this series, it is soo good
@JuanGabrielOyolaCardonaАй бұрын
Thanks for sharing 😀👍 greetings from Colombia.
@yanononopon3 жыл бұрын
Hey Aladdin, great video ! It's a subject that's quite dear to my heart since my grandpa is slowly becoming blind from diabetes. Anyways, love your content :)
@Draculasius2 жыл бұрын
@13:32 How are you doing this command showing line by line? Is this a special feature? Because I think it is good for presentations.
@duongkstn9 ай бұрын
hey, instead of import sys and sys.extit(), you can type : "exit(0)" for short
@teddybest023 жыл бұрын
@Aladdin Persson Thank you for the great ideas you gave us, I have an extra idea here. Can you show us how we can use Ben Graham(Competition winner) pre-processing method (specially color version) and train the model from different transfer learning models? I think this way the result will be improved more.
@chainonsmanquants16303 жыл бұрын
12:15 Baseline easy solution to implement: a 2d convolutional layer in sequential mode with like 5 successive layers and kernels of size between 3x3 and 10x10 to account for the small size of the target. End the convolutional layer with flatten and dense with 5 units for 5 categories. How to do better... Well I'm not really sure
@iducater98823 ай бұрын
How many epochs did you run the train_blend? My score tends to be significantly lower than without combining left and right blending.
@pankajchand67612 жыл бұрын
Which deep learning framework do you feel is better for medical imaging applications like this? TF or PyTorch?
@danobdon38812 жыл бұрын
nice explanation... how can prepare this data for few shot learning? Any idea? would be very helpful.
@dishantdeshmukh53303 жыл бұрын
Showing subclassing model class you should implement call method in unet how and were to implement call function in unet
@abdelrahmanragab60443 жыл бұрын
Nice video Please mention to me where are you found the dataset that you used
@arunavamaulik193 жыл бұрын
#Suggestion When you talk about something that you have described in another video, can you put a link to it in the description? Thanks!
@MuhammadHussain-ws1xs3 жыл бұрын
Can you please do a similar video but for object detection with multiple objects in a single image (custom dataset), I can't seem to find any tutorial on custom object detection in pytorch
@inhibited444 ай бұрын
interesting. I am working with a data set that I resized to (32, 32) from (150, 150) because I saw someone do that in another video. I get a fit of .44 for very few images. I might try increasing resolution too and see what happens for the fit
@charlesnasamu11732 жыл бұрын
I have a quick question; After preprocessing the images and the size of each images changed, so why did you not compute the mean and std for the train transforms? @
@chainonsmanquants16303 жыл бұрын
Really instructive thank you !
@nikhilthapa9300 Жыл бұрын
Could you do a tutorial on multiple instance learning too please?
@erwanerwan61962 жыл бұрын
HI is it possible o do with TENSOR FLOW ??
@jeetshah46993 жыл бұрын
any reason as to why you haven't used transfer learning, like resent and then just changed the classifier???
@Gamma33 жыл бұрын
Amazing video! Thanks
@ashwinjayaprakash79913 жыл бұрын
This is golden! Looking at Aladdins solution: > He used Albumentations for transforms > Single dataset class for botht training and validation > used argmax as loss function > Using efficientnet > batch size 64, num epochs 100 > makes a baseline with 120 by 120 images, notes all validation values > tried preprocessing the image to trim the images > created modification to loss function > Getting a balnced dataloader > didnt work for him > heavy data augmentation > left and right image augmentation by blending - didnt understand. > image resolution increase Thing he didnt try: > training validation data > larger model > larger image resolution > Ensemble of CNN models > Tweaking loss values
@AladdinPersson3 жыл бұрын
Awesome summary. Regarding the usage of left and eye information, so each patient has left eye and right eye image, if we send information of both maybe the model can utilize this extra information. So I pretrained efficientnet using only single eye, then extract features from left and right conatenate those features and train a separate fc network on top of those.
@truonggianga2tk423 жыл бұрын
Thank you and wait your other Kaggle Video.
@chainonsmanquants16303 жыл бұрын
I think you could have had better results if you fusionned the images of left and right directly to train the model on. The result image would be showing the two eyes next to one another.
@odysseashlap3 жыл бұрын
Another idea: create synthesized ("fake") data created by a GAN-Unet architecture (similar to pix2pix) so as to increase the amount of training data
@teddybest023 жыл бұрын
Good idea
@Ayayron19983 жыл бұрын
Great subject choice!
@alanjohnstone87663 жыл бұрын
How/why did you choose to use Efficientnet as opposed to other architectures?
@AladdinPersson3 жыл бұрын
No particular reason really, it's performed well from my understanding on many benchmarks (imagenet) and I've coded it from scratch a while ago which makes me feel like I have a decent understanding of what's going on in the architecture
@theeFaris3 жыл бұрын
For augmentation, i think you should've skipped blur
@apurbasarkar69183 жыл бұрын
you solved that like a boss
@xin25162 жыл бұрын
If I understand it corrently, the input sizes for the efficientnet variants are fixed, like 224x224 for B0. So, I am confused about how can you increase the resolution for the imgs? Thx
@fasolya993 жыл бұрын
How long did the whole process took from you? beside the recording. Also worth to mention that I'm enjoying your videos and find them very helpful ♥️🌹
@AladdinPersson3 жыл бұрын
A few days?
@indianengineer5802 Жыл бұрын
you used full 88.29 GB data for trainning ???
@nova25773 жыл бұрын
With my admiration!
@frankrobert68672 жыл бұрын
Thumb up your tutorials.
@Murmur11313 жыл бұрын
Thanks!
@DaffaAmanu3 жыл бұрын
Amazing video! Though I have a problem when I tried running the dataset code on jupyter notebook, which is that it doesn't recognize the config.val_transforms attribute. Any solution to this?
@sayedathar25073 жыл бұрын
Lots of Love ❤️ Your videos are saviour ❤️❤️😍
@pareshkamble78883 жыл бұрын
Hi Aladdin, We also tried similar approach for DR classification. However, instead of directly resizing the images to 650x650, we identified and cropped the smallest square that encircle the circular DR image and resized to same size. Thereby, we were able to retain the aspect ratio. What do you suggest, do retaining aspect ratio have some advantage over regular resizing?
@ArpitYadav-ws5xe2 жыл бұрын
Excellent
@teddybest023 жыл бұрын
Keep it up!!!
@nguyenvietdung75882 жыл бұрын
where can I get the test_label.csv
@019_nishantrajsingh910 ай бұрын
can anyone please tell me how to get the validation data i.e. valLabels
@jeremymunroe10 ай бұрын
You can make your own😅
@joaobarreira5221 Жыл бұрын
in a similar situation, I trie to use JointsMSE Loss, I change MSE to RMSE and the model converged better and faster.
@sumanmahapatra88813 жыл бұрын
Nice content 👍 Can you please elaborate how can we use Kaggle GPU for deploying Deep learning models?
@virgilioespina3 жыл бұрын
Idol! What's up?
@ccuuttww3 жыл бұрын
It's all about the quality of data the images look terrible and they just mix the left eye and right eye together
@odysseashlap3 жыл бұрын
Lets goooo
@dome81163 жыл бұрын
May I ask how old you are sir?
@AladdinPersson3 жыл бұрын
yea I'm 23
@SUGATORAY3 жыл бұрын
Great video. Btw, how did you make the video to show line numbers as you developed (while also speeding up for some lines and pausing or running at 1x speed at times when necessary)? Am curious! These videos take an incredible amount of time to plan and make. Thank you, very much for the great presentation and the work you put behind these videos. 👏👏⭐️
@bajrangchapola67483 жыл бұрын
👍👍
@Dr_Shafi Жыл бұрын
please stop don't waste others time, a humble request