Detection Head | Essentials of Object Detection

Рет қаралды 5,701

Kapil Sachdeva

Күн бұрын

Пікірлер: 22

@frazuppi4897 Жыл бұрын

this channel is amazing!!!!

@KapilSachdeva Жыл бұрын

🙏 not sure if there is anything for you to learn from my channel but sincerely appreciate your kind words.

@kshamanthkumar6042 Жыл бұрын

Awesome 🤩, thank you so much sir.

@KapilSachdeva Жыл бұрын

🙏

@deeptirawat6091 4 ай бұрын

At 11:02, instead of "the first rows of the first four channels will be for box coordinates", do you intend to say "the first cells of the first four channels will be for box coordinates"?

@harshith_takkala Жыл бұрын

clean explanation !

@KapilSachdeva Жыл бұрын

🙏

@AdityaPrakash-nk9gc 5 ай бұрын

At 5:01 could you please explain why is it [1,5] and not [5,1]? Shouldn't the coordinates be in (x,y) format?

@KapilSachdeva 5 ай бұрын

No the coordinates are in [y,x] … nothing specific about it as such, just a convention used in all object detection models.

@husseinjlailaty5852 Жыл бұрын

Very nice lecture sir. Thank you! Isn't the cell position [1][5] at the 19th cell (not the 15th cell) ?

@KapilSachdeva Жыл бұрын

Yes it will be the 19th cell, do I say it 15th cell in the tutorial?

@husseinjlailaty5852 Жыл бұрын

@@KapilSachdeva Yes, no worries. Please keep on doing your magnificent work.

@КириллКлимушин Жыл бұрын

Thank you❤

@KapilSachdeva Жыл бұрын

🙏

@gqgqrghqrhgq Жыл бұрын

Hi ! Thank you for the great tutorial. I understand why we use the detection head and how it works. But I dont get, how we would combine the 3 outputs of the 3 heads. How would we know, which output/head (highleve, lowlevel, midlevel) is responsible for which ground truth box ? So that we can calculate the loss. Or is there a way to combine the output of the 3 heads to a single one ? Thank you

@KapilSachdeva Жыл бұрын

Assuming you are familiar with the notion of anchor boxes. The anchor boxes are assigned to different levels and during training you associate the ground truth box with an anchor box. This is how a particular level becomes responsible for predicting for the ground truth box.

@gqgqrghqrhgq Жыл бұрын

@@KapilSachdeva Thank you for the response and your awesome videos ! I think I get it now. I never found a good explantation for it. Do we consequently use bigger anchor boxes for the higher up levels and smaller ones for the lower level bounding boxes ? And therefore we know which ground truth box to assign to which layer using the IuO score ?

@KapilSachdeva Жыл бұрын

Yes