Nice, can you please make a video on detecting objects from a video using yolo world with boudiing box and confidence score. Thanks
@nitishreddynandyala260910 ай бұрын
very useful, thank you
@fintech137810 ай бұрын
is it vision language model like CLIP?
@next_phase10 ай бұрын
yes, it is similar to that. But this also provides bounding boxes.
@informative741010 ай бұрын
How to convert it into tflite ??
@next_phase10 ай бұрын
I could not figure out a way to do that. I tried to do it using ultralytics but did not work out. You can check my code here: colab.research.google.com/drive/1cxqMiSuNPSwy4z8nhgxIVK8YQxINbak_?usp=sharing
@lxt-j1q8 ай бұрын
The accuracy of yolo-world is too low.
@next_phase8 ай бұрын
I agree. However, don’t forget that this is the first version of YOLO with such ability. The next versions will be better.