Top Object Detection Models in 2023 | Model Selection Guide sponsored by Intel

  Рет қаралды 21,176

Roboflow

Roboflow

Күн бұрын

Description:
Discover the top object detection models in 2023 in this comprehensive video. We compare models like YOLOv8, YOLOv7, RTMDet, DETA, DINO, and GroundingDINO based on metrics like Mean Average Precision, community support, packaging, and licensing for you to decide which is best for your production AI applications. The video also details the challenges in comparing model speed and highlights important nuances within the realm of object detection, like choosing the right model for the right hardware and use case. It's an essential watch for anyone interested in computer vision and model selection. This research was sponsored by Intel.
#ObjectDetection #ComputerVision
Chapters:
- 00:00 Introduction
- 00:35 Object Detection
- 01:42 Mean Average Precision
- 02:28 Speed
- 03:40 Paper, Packaging, and License
- 04:35 YOLOv8
- 05:21 YOLOv7
- 06:06 YOLOv6-v3
- 07:01 RTMDet
- 07:46 RT-DETR
- 08:50 DETA
- 10:02 GroundingDINO
- 10:37 Model Community Comparison
- 11:46 Conclusion
Resources:
- 🌏 Roboflow: roboflow.com
- 📚 Roboflow Notebooks Repository: github.com/roboflow/notebooks
- 🌌 Roboflow Universe: universe.roboflow.com
- 📈GPU vs Intel HPU (new hardware options for AI): blog.roboflow.com/gpu-vs-hpu/
Stay updated with the projects I'm working on at github.com/roboflow and github.com/SkalskiP! ⭐
Remember to like, comment, and subscribe for more content on AI, computer vision, and the latest technological breakthroughs! 🚀

Пікірлер: 70
@hawkingradiation3774
@hawkingradiation3774 9 ай бұрын
would also like to see a video for comparison for segmentation tasks as well.
@Roboflow
@Roboflow 9 ай бұрын
Awesome idea! I'm curious if there are more people who would like to see that. It's a lot of work to create video like that.
@sozno4222
@sozno4222 9 ай бұрын
@@RoboflowI would also be interested in a video like that
@visuality2541
@visuality2541 9 ай бұрын
Indeed
@AlainPilon
@AlainPilon 9 ай бұрын
Thanks for the work! I am wondering if the COCO metric is actually useful. It is an interesting comparison point but I am not sure how it translates in practice given that most people only train detector for a very few set of classes. Community support, documentation and how the model integrates in the current ecosystem is much more impactful and I am glad you added these to your chart.
@Roboflow
@Roboflow 9 ай бұрын
I'm glad you agree with my methodology. I think the license is also very important. After all you must be able to use the model in your project. As for mAP, 100% agree. I'd love to have other metrics that I could use. We developed RF100 metric - paperswithcode.com/dataset/rf100, but I didn't have enough datapoints to compare all the models.
@ferneutron
@ferneutron 5 ай бұрын
Super great job, thanks!
@rperezalejo
@rperezalejo 9 ай бұрын
Great video, right now I am working on a real time sport object detection and this video comes like a charm for me. I can test other posibilities I did not had in mind.
@Roboflow
@Roboflow 9 ай бұрын
Awesome we came at the right one! I’d love to here about your results once you finish tests.
@rperezalejo
@rperezalejo 9 ай бұрын
​@@Roboflow I have to retrain on 4k images first and then see the performance of the models in real time, I am not sure If i am going to be able to test the all, but the idea of having the golden cub gives a sign on where to start. It is a great guide anyways
@juanolano2818
@juanolano2818 9 ай бұрын
Will go with Yolo8 for my current microbes identification project :) Thank you Piotr!
@Roboflow
@Roboflow 9 ай бұрын
If it is open source or academic good choice! 👍🏻
@saurabhgupta7148
@saurabhgupta7148 5 ай бұрын
I am also working with images of micro organisms. Did you get good results with YoloV8?
@satellite-image-deep-learning
@satellite-image-deep-learning 9 ай бұрын
Fantastic summary, thank you for the effort that went into this! MMdetection has come up before and I would love an intro video on it 😊
@Roboflow
@Roboflow 9 ай бұрын
We already have MMDetection. :) Take a look on our channel
@segheysens
@segheysens 9 ай бұрын
They already created a video here! 🙌 kzbin.info/www/bejne/a5zKiKyla7iaaqc
@mpty2022
@mpty2022 9 ай бұрын
thanks for sharing your research
@Roboflow
@Roboflow 9 ай бұрын
My pleasure!
@johannesmokami5760
@johannesmokami5760 9 ай бұрын
Thanks for the info I will definitely try them as well.
@Roboflow
@Roboflow 9 ай бұрын
Awesome! Which detector are you going to try?
@johannesmokami5760
@johannesmokami5760 9 ай бұрын
I'm currently trying out YOLOv8 but I'd like to try YOLOv7 and GroundingDINO @@Roboflow
@romroc627
@romroc627 9 ай бұрын
Excellent video, really useful.
@Roboflow
@Roboflow 9 ай бұрын
I’m so happy to see such a positive feedback!
@mr_tpk
@mr_tpk 7 ай бұрын
Thank you ❤
@Seethis-HD
@Seethis-HD 9 ай бұрын
Excellent video. Thanks for the efforts. I was wondering why you didn't consider YOLO-NAS in the list?
@Roboflow
@Roboflow 9 ай бұрын
I considered it but ultimately decided not to include it. I’m pretty confident those models are better choices.
@PhilippBlum
@PhilippBlum 4 ай бұрын
@ow What was the reason not to include it? Accuracy?
@EliSpizzichino
@EliSpizzichino 9 ай бұрын
We need a platform to fully compare them on real datasets on real training on the same device. Is also important to keep track if a version change produce worsening quality. I've noticed for example that between one minor version and another of the ultralitics codebase the quality of the final trained model worsened by a lot.
@Roboflow
@Roboflow 9 ай бұрын
That's super interesting! Could you share more insights on what the versions were? I'd love to do more investigation. As for the "platform to fully compare them on real datasets", have you seen RF100? paperswithcode.com/dataset/rf100
@EliSpizzichino
@EliSpizzichino 9 ай бұрын
@@Roboflow The last tested good version was 8.0.103. Unfortunately I had no time since then to do further investigation my self but I remember trying a couple of training out with some versions after that and got worse results I haven't tested the RF100 yet, it's a good effort and I like what you do as company (I never left such a good comment to anyone in my life :)
@user-fj1zz9mi5r
@user-fj1zz9mi5r 9 ай бұрын
I was looking for performance over the time inference for edge devices. I was trying to use Yolov8 for edge deployment into STM32 but at the end, i realized this model was too big for this card. What do you think is a good model for a good ratio between inference time / model size? Thanks for your response
@seanolivieri4829
@seanolivieri4829 8 ай бұрын
Do you have a video on licenses? I dont understand any of those and which one should I use if I want to be able to sell my program or call it my own
@rafael.gildin
@rafael.gildin 9 ай бұрын
Great Video 🎉
@Roboflow
@Roboflow 9 ай бұрын
My pleasure! 🔥
@tryingtobeproductive
@tryingtobeproductive 9 ай бұрын
This video is extremely useful 10/10
@Roboflow
@Roboflow 9 ай бұрын
Thank you! Awesome to hear such a positive feedback 🔥
@user-rx4wd3xn2c
@user-rx4wd3xn2c 7 ай бұрын
can we use yolov8 pretrained weights for commercial use?
@pleison111
@pleison111 4 ай бұрын
I have a question, I am working on a OCR project, I am using a fastrcnn with resnet50 as object detector, and then I need something like a conv + GRU or ViT to decode the text, do you have some suggestions regarding OCR?
@Roboflow
@Roboflow 4 ай бұрын
First of all why Fast RCNN? As for OCR did you try Tesseract?
@zskater1234
@zskater1234 9 ай бұрын
I’m currently porting GroundingDINO to the transformers library so buckle up
@Roboflow
@Roboflow 9 ай бұрын
Uuu! Awesome! I can’t wait to see that happening. Being able to setup GroundingDINO with single pip install.
@zskater1234
@zskater1234 9 ай бұрын
@@Roboflow the model is already ported, now I’m tackling the tests and documentation if everything goes well by next weekend I’ll finished it and will await the HF review
@shubh722
@shubh722 9 ай бұрын
I am doing an object detection task and get 97.4% accuracy on the dataset using yolov5 and will be running it on an edge device. Is yolov5 too old and Should I train a yolov8 model for faster inference? As I think accuracy will be almost similar as it’s already 97.4%. Or is it task specific. If yolov5 is performing good then is there any need to change. If anyone can suggest please
@Roboflow
@Roboflow 9 ай бұрын
I don’t think you can expect better accuracy than that. The main issue here is that YOLOv5 did not have proper Python packaging so integrating it into larger projects was problematic.
@omigator
@omigator 9 ай бұрын
We switched from Yv8 to Yv5 because it gave better performance without any loss in accuracy on our edge devices.
@shubh722
@shubh722 9 ай бұрын
@omigator What do you think about the inference times between v8 and v5? Is it real time? Also idk I found yolov5 easier to use as well. I was training it in Azure ml so was much easier to tweak the files for v5 to train there rather than v8. And the accuracy is pretty good as well.
@satyajitpanigrahy7742
@satyajitpanigrahy7742 9 ай бұрын
Kindly, Update the ultralytics package for YOLOv4 model
@Roboflow
@Roboflow 9 ай бұрын
Hi @satyajitpanigrahy7742 👋 Ultralytics is a separate team. Kindly, try to submit bug report in their repository: github.com/ultralytics/ultralytics
@tyronetyrone2652
@tyronetyrone2652 9 ай бұрын
Which framework is better to use in embedded chips?
@Roboflow
@Roboflow 9 ай бұрын
which board are you using?
@sarathkumar-gq8be
@sarathkumar-gq8be 5 ай бұрын
Which model will perform better in raspberry pi
@appliedml8665
@appliedml8665 9 ай бұрын
Yolo gold is now available.
@Roboflow
@Roboflow 9 ай бұрын
Yeah I know... This video was recorded before YOLO GOLD was released. I didn't have time to play with it yet. Have you?
@8eck
@8eck 9 ай бұрын
Where is DETA video? Couldn't find DETA with 100k stars... Could you please add github link here.
@Roboflow
@Roboflow 9 ай бұрын
For now we only have DETR. You can find it here: kzbin.info/www/bejne/d36bdWegbrCil7c As for star count, DETA is distributed via transformers library and that's what I used to measure community size.
@titusfx
@titusfx 9 ай бұрын
In 2:28 why not just do asymptotic analysis (computational complexity analysis).
@Roboflow
@Roboflow 9 ай бұрын
Hi 👋🏻! You mean use FLOPS to asses complexity?
@sanchaythalnerkar9736
@sanchaythalnerkar9736 9 ай бұрын
Can you show actual code and real time comparison of these?
@Roboflow
@Roboflow 9 ай бұрын
You mean independent time benchmark comparing the speed of all of those models?
@sanchaythalnerkar9736
@sanchaythalnerkar9736 9 ай бұрын
Yes Exactly , a side by side comparison@@Roboflow
@Roboflow
@Roboflow 9 ай бұрын
@@sanchaythalnerkar9736 not sure if it really can be side by side. To truly measure the model speed we need to make sure there is no other heavy process running on the machine. But sure we can try to make that happen. I'll add it to our TODO list.
@8eck
@8eck 9 ай бұрын
Found only DETA with 198 stars, not 100k like in your table...
@Roboflow
@Roboflow 9 ай бұрын
I responded to that question under your other comment :)
@8eck
@8eck 9 ай бұрын
RT-DETR have 355 stars, 20k+
@Roboflow
@Roboflow 9 ай бұрын
To be honest no one use implementation from original repository. RT-DETR is distributed via PaddlePaddle package. That's why we use 20k+ star count. I know it is not perfect... but like I said I decided to use the top repo that make the model accessible.
@8eck
@8eck 9 ай бұрын
​@@Roboflowcan you please drop some links? Thank you.
@Roboflow
@Roboflow 9 ай бұрын
@@8eck take a look here: github.com/PaddlePaddle/PaddleDetection/tree/develop/configs/rtdetr and here: huggingface.co/docs/transformers/main/en/model_doc/deta
@adurks4846
@adurks4846 9 ай бұрын
Personally, I've found yolov8 to be disappointing in the real world. I work in aerial/satellite imaging and yolov8 performs ~10% worse than scaled-yolov4. Most of the others on that list perform similarly. Overall, it seems like once you leave the types of images/targets in the COCO dataset, the metrics mean less and less for what will do well on your project.
@Roboflow
@Roboflow 9 ай бұрын
Absolutely agree! I even said that in the video. I'd love to have other metric to compare models, not just mAP on COCO. The moment to start to fine-tune the model on your dataset that number means nothing. Do you care about the speed when you process aerial/satellite imaging?
@adurks4846
@adurks4846 9 ай бұрын
@@Roboflow We don't care that much about speed. However, we don't typically have much data which means that the larger models seem to do worse. Do you guys have in-house metrics for some of these models using the roboflow-100?
A clash of kindness and indifference #shorts
00:17
Fabiosa Best Lifehacks
Рет қаралды 97 МЛН
Survival skills: A great idea with duct tape #survival #lifehacks #camping
00:27
How to Choose the Best Computer Vision Model for Your Project
12:59
How AI 'Understands' Images (CLIP) - Computerphile
18:05
Computerphile
Рет қаралды 186 М.
Fast Segment Anything (FastSAM) vs SAM | Is it 50x faster?
16:02
YOLOv10: Train a Custom Model and Run Inference on Live Webcam
24:37
Nicolai Nielsen
Рет қаралды 17 М.
How YOLO Object Detection Works
17:04
DeepBean
Рет қаралды 27 М.
Scientific Concepts You're Taught in School Which are Actually Wrong
14:36
Fast Zero Shot Object Detection with OpenAI CLIP
29:32
James Briggs
Рет қаралды 10 М.
EXEED VX 2024: Не өзгерді?
9:06
Oljas Oqas
Рет қаралды 44 М.
OZON РАЗБИЛИ 3 КОМПЬЮТЕРА
0:57
Кинг Комп Shorts
Рет қаралды 1,8 МЛН
Klavye İle Trafik Işığını Yönetmek #shorts
0:18
Osman Kabadayı
Рет қаралды 5 МЛН