Feature Pyramid Network | Neck | Essentials of Object Detection

  Рет қаралды 13,567

Kapil Sachdeva

Kapil Sachdeva

Күн бұрын

This tutorial explains the purpose of the neck component in the object detection neural networks. In this video, I explain the architecture that was specified in Feature Pyramid Network paper.
Link to the paper [Feature Pyramid Network for object detection]
arxiv.org/abs/...
The code snippets and full module implementation can be found in this colab notebook:
colab.research...
The torchvision has a more flexible implementation which would take more than 3 feature layers from backbone
pytorch.org/vi...

Пікірлер: 65
@paedrufernando2351
@paedrufernando2351 Жыл бұрын
Keep the pearls of wisdom dropping sir..Privilage to learn from you miles across...
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏 thanks for the kind words.
@lostpenguin3682
@lostpenguin3682 9 ай бұрын
very helpful! I really like that you're explaining it with an example with concrete numbers!
@KapilSachdeva
@KapilSachdeva 9 ай бұрын
🙏
@AkhileshShukla-d5x
@AkhileshShukla-d5x Жыл бұрын
Sir, I have a lot of to say after finding your video on KZbin but just ❤ , respect and thank you. 🙏🙏
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏
@ianhowe8881
@ianhowe8881 Ай бұрын
Incredible explanatory skills!
@brunodias3524
@brunodias3524 Жыл бұрын
I am so happy I found this video. Really good content!
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏
@TeamDman
@TeamDman Жыл бұрын
Thank you for sharing your knowledge!
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏
@vipingautam9501
@vipingautam9501 Жыл бұрын
This is excellent! I just love it.
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏
@NehadHirmiz
@NehadHirmiz Жыл бұрын
Excellent tutorial. Thank you very much.
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏
@applestarpie
@applestarpie Жыл бұрын
I like your videos, which are easy and fun to learn. Thanks a lot!
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏
@ranjithtevnan2909
@ranjithtevnan2909 3 ай бұрын
I have 2 questions. How are the 1X1 and 3X3 CNN used trained to obtain the weight parameters? Also shouldn't 3X3 with stride 1 change the dimension, though it keeps the number of channels the same the size of the output feature would have changed and reduced by 2
@dmgeo
@dmgeo 2 ай бұрын
How is this different from U-net? I think they're pretty similar if you think that in the U-net you're going down in the encoder, up in the decoder and sideways with the skip connections. It's like an upside-down U-net
@science.20246
@science.20246 7 ай бұрын
is useful to add channel and spatial attention in conv layers to improve
@rampavanmedipelli6152
@rampavanmedipelli6152 Жыл бұрын
Thank you... excellent clarity... please try to make a tutorial on anchor free detectors like FCOS..
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏 yup. First need to implement it :)
@yogeshwarshendye4857
@yogeshwarshendye4857 6 ай бұрын
If done with UNet, it won't require upsampling as we concatenate the layers right?
@krishnachaitanya7374
@krishnachaitanya7374 Жыл бұрын
This is quite informative and helpful. Can you please create a video on prediction heads in fpn as in how to assign a predicted bbox to a particular feature map. That would be quite helpful.
@KapilSachdeva
@KapilSachdeva Жыл бұрын
Yes, thinking to make some videos about different label assignment techniques. Now about your question - the right terminology or phrasing of your request would be how to assign an anchor box to a particular feature map.
@AdnanMunirkhokhar
@AdnanMunirkhokhar Жыл бұрын
amazing explanation Dr.
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏
@kylehuang9035
@kylehuang9035 Жыл бұрын
Could you give a tutorial of diffusing model to your VAE series? Its related and would like to see your explanation!
@KapilSachdeva
@KapilSachdeva Жыл бұрын
Though I understand the theory it’s just that I have never implemented/used them myself. I prefer to share those concepts that I have implemented myself and applied on some real world problem. But not saying no :) maybe one day. Thanks for the ask though.
@vincentpelletier1246
@vincentpelletier1246 6 ай бұрын
I don't know if I got this wrong but if I take a 1x64x26x26 feature through a convolution that has a K=3 and S=1, I will definitely not end up with a 1x64x26x26, but with a 1x64x24x24. To achieve the desired shape would require a P=1. If I'm not correct, would someone please explain how the dimensions would work in this case?
@abhishekdhiman5719
@abhishekdhiman5719 3 ай бұрын
Thanks for sharing the knowledge
@rampavan4094
@rampavan4094 Жыл бұрын
Could you give a tutorial on the vision transformer model for object detection?
@KapilSachdeva
@KapilSachdeva Жыл бұрын
in some time. have been preoccupied with some stuff but would try my best
@user-uf3md5ub5j
@user-uf3md5ub5j Жыл бұрын
Thanks a lot! would be the following videos soon?
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏 yes.
@DIAHAYUNINGTYASWATI
@DIAHAYUNINGTYASWATI 11 ай бұрын
Do you know how to combine AFPN with the YOLO v8 algorithm? If you know, please tell me. Thanks
@LongLeNgoc-qq5qn
@LongLeNgoc-qq5qn Жыл бұрын
what about height and width are odd number (415), sir? In that case, the size after conv and after upsample is miss match. How to fix that, please!
@KapilSachdeva
@KapilSachdeva Жыл бұрын
Resize the image to 416 or any other size (e.g. 640) before feeding it to the network.
@cheeziobodini
@cheeziobodini Жыл бұрын
Instead of doing the upsampling via pytorch module and being angry about it, would it be any more useful to train an additional layer to do the upsampling instead? I'm thinking of a layer analogous to the decoder layer in an autoencoder.
@KapilSachdeva
@KapilSachdeva Жыл бұрын
No need to be angry at it :) … yes you could do that. As a matter of fact the additional layers after upsampling is to reduce it effects. The cost would be number of parameters. So it is always a trade off.
@cheeziobodini
@cheeziobodini Жыл бұрын
@@KapilSachdeva Thank you! informative video btw
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏
@manueljohnson1354
@manueljohnson1354 4 ай бұрын
Excellent
@harshith_takkala
@harshith_takkala Жыл бұрын
thankyou sir !
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏
@lordfarquad-by1dq
@lordfarquad-by1dq Жыл бұрын
thank you for the content , next video soon?
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏 … yes. Most likely tomorrow. Thanks for keeping me accountable.
@lordfarquad-by1dq
@lordfarquad-by1dq Жыл бұрын
@@KapilSachdeva thank you again for the content, looking forward for more of these videos
@KapilSachdeva
@KapilSachdeva Жыл бұрын
Still working on the next video; not yet happy with it hence not published yet.
@III.Jennifer
@III.Jennifer 27 күн бұрын
209 Lisandro Ridge
@farooqdsp
@farooqdsp Жыл бұрын
new video when ?
@KapilSachdeva
@KapilSachdeva Жыл бұрын
today ... very late sorry :(
@TeddyFlanagan-q8l
@TeddyFlanagan-q8l Күн бұрын
Clement Landing
@nayab.quteer
@nayab.quteer Жыл бұрын
Can you make the video in Urdu language
@KapilSachdeva
@KapilSachdeva Жыл бұрын
There are urdu subtitles and may be that will be of some help!
@DorisCorey-j7i
@DorisCorey-j7i 7 күн бұрын
Hernandez Betty Lewis Kenneth Gonzalez Christopher
@TameraSweet-n3t
@TameraSweet-n3t 8 күн бұрын
Haley Corner
@StudentsThough
@StudentsThough Ай бұрын
Garcia Larry Lewis Charles Hernandez Carol
@FinancialYaweli
@FinancialYaweli Ай бұрын
Walker William Moore Patricia Perez Anthony
@MichelleMoore-l2c
@MichelleMoore-l2c Ай бұрын
Pagac Road
@GoldYvonne-r9o
@GoldYvonne-r9o 28 күн бұрын
Hernandez Michael Taylor Donald Walker Richard
@SgheGejsj
@SgheGejsj Ай бұрын
Wilson Jose Lewis Matthew Smith Matthew
@DorisCorey-j7i
@DorisCorey-j7i 17 күн бұрын
Moore Kevin Moore Sharon Lewis Richard
@LoisStewart-t6g
@LoisStewart-t6g Ай бұрын
Thompson Cynthia Martin Frank Brown Jason
@EraRyba
@EraRyba 29 күн бұрын
8831 Osvaldo Heights
GIoU vs DIoU vs CIoU | Losses | Essentials of Object Detection
19:29
Kapil Sachdeva
Рет қаралды 4,6 М.
Detection Head | Essentials of Object Detection
15:32
Kapil Sachdeva
Рет қаралды 5 М.
小蚂蚁会选到什么呢!#火影忍者 #佐助 #家庭
00:47
火影忍者一家
Рет қаралды 91 МЛН
Don't look down on anyone#devil  #lilith  #funny  #shorts
00:12
Devil Lilith
Рет қаралды 45 МЛН
Feature Pyramid Network for object detection
22:38
Code With Aarohi
Рет қаралды 13 М.
Why Does Diffusion Work Better than Auto-Regression?
20:18
Algorithmic Simplicity
Рет қаралды 342 М.
The Reparameterization Trick
17:35
ML & DL Explained
Рет қаралды 21 М.
Focal Loss for Dense Object Detection
12:57
ComputerVisionFoundation Videos
Рет қаралды 32 М.
The future of AI looks like THIS (& it can learn infinitely)
32:32
C4W3L08 Anchor Boxes
9:43
DeepLearningAI
Рет қаралды 138 М.
小蚂蚁会选到什么呢!#火影忍者 #佐助 #家庭
00:47
火影忍者一家
Рет қаралды 91 МЛН