Computer Vision on NPU - all you need to know

  Рет қаралды 2,640

Anton Maltsev

Anton Maltsev

Күн бұрын

Пікірлер: 14
@wolpumba4099
@wolpumba4099 5 ай бұрын
*Summary: Running Computer Vision Models on NPUs* What is an NPU? (0:37) - NPUs are specialized silicon chips optimized for running neural network computations, especially matrix multiplications. - Unlike CPUs and GPUs, they can't run general-purpose programs, focusing purely on neural network inference. - Many different names exist for these chips, including LPU, TPU, VPU, etc., but they share the core idea of accelerating neural network calculations. Why Use NPUs? (2:29) - Main advantages: Reduced power consumption, lower device cost, potential for significant speedups compared to CPU/GPU for specific tasks. - Main disadvantages: Increased development complexity, limited choice of neural network architectures, more intricate deployment and testing processes. Challenges of working with NPUs: - Diverse Ecosystem: (7:42) A vast landscape of vendors, frameworks, and boards makes finding a perfect solution difficult. Each vendor typically offers its own custom framework. - Model Export and Compatibility: (10:09) - Requires careful preparation, including specific patches and quantization, to adapt your model to the target NPU architecture. - Non-maximum suppression (NMS) (18:59) often needs to be handled outside the NPU, requiring separate code or fallback mechanisms. - Memory Limitations: (20:54) - Limited memory size on NPUs restricts model size and complexity. - Memory access speed and structure significantly impact performance. - Preprocessing: (22:46) May need to be performed separately on the CPU, GPU, or dedicated accelerator depending on the NPU and its capabilities. - Transformer Support: (23:58) Limited or non-existent on many NPUs, often requiring model adjustments or alternative convolutional architectures. - Layer Support: (25:23) - Advertised layer support can be misleading due to merged layers or limited functionalities. - Always verify compatibility and performance for your specific model layers. - Quantization: (27:33) - Essential for many NPUs to reduce model size and accelerate inference. - Can be complex and lead to accuracy degradation, requiring careful fine-tuning and evaluation. - Benchmarks: (30:30) - Often don't reflect real-world performance. - Always test on your target hardware and specific model for accurate results. Additional considerations: - CPUs play a vital role in data transfer, image decoding, preprocessing, and fallback mechanisms, impacting overall performance (36:43). - C++ is the dominant language for inference on most NPUs, while Python prevails in model training and export (38:45). - Training on NPUs is possible but involves a separate class of processors and different considerations (39:51). i used gemini 1.5 pro
@zorqis
@zorqis Ай бұрын
Good summary and useful for passers by. However, the video contains some small remarks that contain a lot of useful information, so I still recommend watching the whole video.
@shakhizatnurgaliyev9355
@shakhizatnurgaliyev9355 5 ай бұрын
good one!
@andreyl2705
@andreyl2705 5 ай бұрын
awesome)
@diegosantos9757
@diegosantos9757 5 ай бұрын
Dear, tks for the content. Which sbc would you recommend for somente just starting with computer vision?
@AntonMaltsev
@AntonMaltsev 5 ай бұрын
Depends on your budget. The smooth experience is with Jetsons or Intel-based boards. In the case of a low budget, I recommend some RockChip-based solutions.
@diegosantos9757
@diegosantos9757 5 ай бұрын
Tks mate, I will check the rockchip!
@ДенисСлепцов-ь6п
@ДенисСлепцов-ь6п 5 ай бұрын
Здравствуйте, давно слежу за Вашим творчеством. Прошу Вас, продолжайте в том же духе! Очень интересно. Могли бы Вы сказать, доводилось ли Вам размещать нейронную сеть на FPGA ? Если да, то могли бы Вы, пожалуйста, поделиться своим опытом ?
@AntonMaltsev
@AntonMaltsev 5 ай бұрын
Добрый день, спасибо! Пару раз хотел потестить xilinx kria, но меня каждый раз отговаривали со словами что это полный хлам. В целом FPGA дефолтовый не то что хорошо ложиться на архитектуру сетей. Так что не очень понятен смысл даже...
@ДенисСлепцов-ь6п
@ДенисСлепцов-ь6п 5 ай бұрын
@@AntonMaltsev Понял, спасибо
@עינהרע
@עינהרע 5 ай бұрын
You gonna test the new Hailo GenAI m.2 board?
@AntonMaltsev
@AntonMaltsev 5 ай бұрын
It's difficult to buy one piece for home use, and none of my friends or colleagues are using it right now, so I have no chance to borrow. So, it's not in the plans. But if there is a chance, I will try.
@AntonMaltsev
@AntonMaltsev 5 ай бұрын
But the next video will probably be about my experience of using Hailo in production (more about framework and Hailo-8)
@____________________________.x
@____________________________.x 5 ай бұрын
Your jump cuts make this confusing
ADDC 2019 - Dan Abdinoor:  The NPU Revolution
46:17
App Design & Development Conference - ADDC
Рет қаралды 4,1 М.
Choosing AI Edge board in 2024 / 2025
53:32
Anton Maltsev
Рет қаралды 756
Всё пошло не по плану 😮
00:36
Miracle
Рет қаралды 3,4 МЛН
ROSÉ & Bruno Mars - APT. (Official Music Video)
02:54
ROSÉ
Рет қаралды 90 МЛН
Osman Kalyoncu Sonu Üzücü Saddest Videos Dream Engine 262 #shorts
00:20
Миллионер | 2 - серия
16:04
Million Show
Рет қаралды 1,6 МЛН
Running LLMs on the NPU of the Rockchip RK3588
13:48
LivingLinux
Рет қаралды 5 М.
I Built a CoPilot+ AI PC (without Windows)
12:50
Jeff Geerling
Рет қаралды 402 М.
Ultralytics Yolo (Yolov11). Do you need it?
14:09
Anton Maltsev
Рет қаралды 2,1 М.
The five main reasons your Computer Vision system will not work
8:09
Anton Maltsev
Рет қаралды 1,1 М.
Albumentations with Vladimir: everything about augmentations
54:08
Everything about OpenCV, OpenCV.ai in 2024
44:35
Anton Maltsev
Рет қаралды 1,4 М.
Всё пошло не по плану 😮
00:36
Miracle
Рет қаралды 3,4 МЛН