Detection Head | Essentials of Object Detection

  Рет қаралды 5,701

Kapil Sachdeva

Kapil Sachdeva

Күн бұрын

Пікірлер: 22
@frazuppi4897
@frazuppi4897 Жыл бұрын
this channel is amazing!!!!
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏 not sure if there is anything for you to learn from my channel but sincerely appreciate your kind words.
@kshamanthkumar6042
@kshamanthkumar6042 Жыл бұрын
Awesome 🤩, thank you so much sir.
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏
@deeptirawat6091
@deeptirawat6091 4 ай бұрын
At 11:02, instead of "the first rows of the first four channels will be for box coordinates", do you intend to say "the first cells of the first four channels will be for box coordinates"?
@harshith_takkala
@harshith_takkala Жыл бұрын
clean explanation !
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏
@AdityaPrakash-nk9gc
@AdityaPrakash-nk9gc 5 ай бұрын
At 5:01 could you please explain why is it [1,5] and not [5,1]? Shouldn't the coordinates be in (x,y) format?
@KapilSachdeva
@KapilSachdeva 5 ай бұрын
No the coordinates are in [y,x] … nothing specific about it as such, just a convention used in all object detection models.
@husseinjlailaty5852
@husseinjlailaty5852 Жыл бұрын
Very nice lecture sir. Thank you! Isn't the cell position [1][5] at the 19th cell (not the 15th cell) ?
@KapilSachdeva
@KapilSachdeva Жыл бұрын
Yes it will be the 19th cell, do I say it 15th cell in the tutorial?
@husseinjlailaty5852
@husseinjlailaty5852 Жыл бұрын
@@KapilSachdeva Yes, no worries. Please keep on doing your magnificent work.
@КириллКлимушин
@КириллКлимушин Жыл бұрын
Thank you❤
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏
@gqgqrghqrhgq
@gqgqrghqrhgq Жыл бұрын
Hi ! Thank you for the great tutorial. I understand why we use the detection head and how it works. But I dont get, how we would combine the 3 outputs of the 3 heads. How would we know, which output/head (highleve, lowlevel, midlevel) is responsible for which ground truth box ? So that we can calculate the loss. Or is there a way to combine the output of the 3 heads to a single one ? Thank you
@KapilSachdeva
@KapilSachdeva Жыл бұрын
Assuming you are familiar with the notion of anchor boxes. The anchor boxes are assigned to different levels and during training you associate the ground truth box with an anchor box. This is how a particular level becomes responsible for predicting for the ground truth box.
@gqgqrghqrhgq
@gqgqrghqrhgq Жыл бұрын
​@@KapilSachdeva Thank you for the response and your awesome videos ! I think I get it now. I never found a good explantation for it. Do we consequently use bigger anchor boxes for the higher up levels and smaller ones for the lower level bounding boxes ? And therefore we know which ground truth box to assign to which layer using the IuO score ?
@KapilSachdeva
@KapilSachdeva Жыл бұрын
Yes
@shabbirahammed4596
@shabbirahammed4596 Жыл бұрын
nice...
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏
@chaouidhuzgen6818
@chaouidhuzgen6818 Жыл бұрын
hi, amzing explanations, bravo how can i contact you sir ?
@KapilSachdeva
@KapilSachdeva Жыл бұрын
🙏 if you have questions you can always ask them in comments.
A Better Detection Head | Essentials of Object Detection
6:29
Kapil Sachdeva
Рет қаралды 2,3 М.
Каха и лужа  #непосредственнокаха
00:15
Reshape,Permute,Squeeze,Unsqueeze made simple using einops | The Gems
12:09
Feature Pyramid Network | Neck | Essentials of Object Detection
14:38
Kapil Sachdeva
Рет қаралды 14 М.
GIoU vs DIoU vs CIoU | Losses | Essentials of Object Detection
19:29
Kapil Sachdeva
Рет қаралды 4,9 М.
Anchor Boxes | Essentials of Object Detection
9:27
Kapil Sachdeva
Рет қаралды 11 М.
Object detection with Python FULL COURSE | Computer vision
4:35:26
Computer vision engineer
Рет қаралды 53 М.
YOLO V1 - YOU ONLY LOOK ONCE || YOLO OBJECT DETECTION SERIES
35:25
ML For Nerds
Рет қаралды 39 М.
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 1,3 МЛН
Focal Loss for Dense Object Detection
12:57
ComputerVisionFoundation Videos
Рет қаралды 33 М.
Bounding Box Formats | Essentials of Object Detection
7:56
Kapil Sachdeva
Рет қаралды 7 М.