Anton Maltsev

5:18

All you need to know about AI limitations (2025)

19 сағат бұрын

18:54

Usage of Texas Instruments (TI) NPU for Computer Vision

Күн бұрын

3:28

The Dark Side of AI. Stable Point Aware 3D example (SPAR3D)

14 күн бұрын

48:52

Итоги 2024 года в CV и ML. Что было, что ждём.

28 күн бұрын

15:02

How to run neural networks on RockChip in 2025 (rknn-toolkit, rknn model zoo, rkllm).

Ай бұрын

8:22

Is the Radxa Rock5C Lite (RK3582) one of the best boards for hobby Computer Vision Right now?

Ай бұрын

47:55

Choosing a 2d camera for Computer Vision product: The Сomprehensive Guide

Ай бұрын

12:11

Using big pre-trained models for prototyping

2 ай бұрын

4:16

A short sample: Classic stereo Depth vs. Neural stereo Depth vs. Monocular depth.

2 ай бұрын

12:15

How fast is Hailo-8L with boards other than RPi5?

2 ай бұрын

17:21

Depth estimation. From the theory to the Edge.

3 ай бұрын

17:50

Orbbec Gemini 335L. Let's check how it's working.

3 ай бұрын

9:01

Depth Pro - monocular network from Apple. But can it do this?!

3 ай бұрын

53:32

Choosing AI Edge board in 2024 / 2025

3 ай бұрын

14:09

Ultralytics Yolo (Yolov11). Do you need it?

3 ай бұрын

28:31

Имеет ли смысл идти в Computer Vision и ML в 2024?

4 ай бұрын

18:12

Does this even work?! - Radxa NIO 12L (MediaTek Genio 1200)

4 ай бұрын

23:05

Computer Vision and AI for NXP (Debix model A)

5 ай бұрын

13:35

LLMs for RockChip. Guide for RKLLM. RK3588 vs RK3576 comparision

5 ай бұрын

4:01

Data or Models? A lot of AI researchers struggle with this!

5 ай бұрын

44:35

Everything about OpenCV, OpenCV.ai in 2024

5 ай бұрын

9:30

Segment Anything 2 (SAM 2): how to start + limitations

5 ай бұрын

54:08

Albumentations with Vladimir: everything about augmentations

6 ай бұрын

8:09

The five main reasons your Computer Vision system will not work

6 ай бұрын

25:17

OrangePi AiPro. A comprehensive guide: setting up, model export, and overview (Huawei Ascend 310 B4)

6 ай бұрын

18:44

Do you need a new ML model in production?

7 ай бұрын

28:27

Data Science. Чем отличается новичок от эксперта.

7 ай бұрын

12:19

MASA tracker. Is it hype or a pretty nice tracker?

7 ай бұрын

12:22

MAX78000FTHR - how good it's today?

7 ай бұрын

Пікірлер

@MihiroParmar 13 сағат бұрын

Hye thanks for all these videos. But can you please make a video about mili csi virtual channel and how can we use them explaining everything required

@Andreas_Hopf 17 сағат бұрын

What would be relevant for creators is if there were a way to feed in, say, six Midjourney images of the same object, showing it from four sides plus top and bottom, and then have a quality 3D mesh generated from that. So far, Midjourney and others fail to provide six views of the same object. Any ideas?

@antonrrtyq 18 сағат бұрын

paligemma же умеет детектить

@r_ruslan 18 сағат бұрын

@AntonMaltsev I have RK3576 2NPU and 8CPU and my camera fps is 30. I used that one but still not improved. Is it possible to combine cpu+npu? how?

@kuliev.vitaly 21 сағат бұрын

internvl2 уже имела функционал детекции.

@ARMENIA181 Күн бұрын

Thank you for video, can you please create video how to run Qwen2.5-VL on rockchip.

@AntonMaltsev Күн бұрын

It's a little bit complicated right now (Qwen required latest drivers, which are still not available for the main fork of the system). It's possible to do, but for now I need my board with different system (only one of my boards are capable to run qwen). So, definitely, at some point I will try to. But not now.

@goldennotchcreeper4838 Күн бұрын

Very good video! Been watching you for a while now :)

@meirakrutkrut4143 3 күн бұрын

Здраствуйте, извиняюсь что не по теме видео, просто только недавно наткнулся на ваш канал. Я начал интересоваться комптютерным зрением. Эта область мне нравится. Однако заметил, что возможно, может быть лучше войти в эту область со смежной специальности, скажем, обычного ПО-разработчика и тд? Мне интересно, как лучше поступить, чтобы наиболее спокойно войти в эту область на рынке, не оставшись без работы. Просто я подумал, что пороговый вход тут сожет быть большим. Что вы можете посоветовать мне по этому поводу?

@PankajDoharey 7 күн бұрын

LLM based AI is far from diagnosing my water filter issues. I need a water filter repairmen for that.

@vitaliy_dushepa 7 күн бұрын

livebench ai benchmark is quite good.

@vitalyl1327 7 күн бұрын

AGI is already here. I'm not an influencer. I'm an engineer, working on agentic systems built on top of small (below 14B) LLMs. My systems are capable of general problem solving - simply by offloading all the reasoning to external tools (such as Prolog and SMT solvers). I do not believe we need LLMs to be able to reason on their own, we already have a much better and more precise reasoning engine.

@tempacc9589 7 күн бұрын

😂😂😂😂

@vitalyl1327 7 күн бұрын

@tempacc9589 what's so funny? Did you try to offload reasoning to external logical inference? It works way better than human reasoning, scales to any size of the problem.

@jiayojames 7 күн бұрын

The issue is that "AGI" is such a badly defined concept. There's going to be people who refuse to acknowledge that something is an AGI unless it has a mind exactly like a human... which of course would be a totally illogical expectation for how AI will progress. Ultimately the only real way to measure it is in the accuracy, competence, and flexibility of task completion, regardless of whatever reductionist arguments the critics will level in regards to how it gets there.

@vitalyl1327 7 күн бұрын

@ it does not really matter what people think and how they define things. What matters is the tasks this system can perform autonomously. And if these tasks overlap with everything humans do and it can replace humans and do their work better and faster - it's and AGI. I'm only interested in replacing engineers, so don't really care about proving anything beyond this narrow niche.

@s1v7 8 күн бұрын

чувак, продумай заранее что хочешь сказать, мысль скачет туда-сюда - трудно понят что ты вообще хочешь сказать.

@fontenbleau 8 күн бұрын

Yes, the solution is leaking into datasets, but deepseek v2,5 in my local offline test was the only one which repaired very unusual code, all others qwen or mistral Large can't repair that logical problem. I'm testing locally now V3 (next their r1) and it's quite good, the only i need to raise from Q5 to Q6 because quality lacks on low quants. But even on Q5 it's trying hard to write me a Bach symphony in sound programming language. I've noticed how it's self tuned from dumping the all code in every answer, which is sign of dumb models like qwen or mistral, but now it's writing only parts to edit or replace, which saves tokens.

@AntonMaltsev 8 күн бұрын

Definitely, DeepSeek looks nice. But I am sure they used a lot of GPT generation for it, which is not a completely good solution.

@andreyl2705 8 күн бұрын

Anton, what do you think about AI camera "reCamera 2002" from seed studio?

@AntonMaltsev 8 күн бұрын

It's the same SG2002 processor that Milk-V has. But software realization is much better. I expect something like this - kzbin.info/www/bejne/noTZZWxui92Cqqc But did not try it myself.

@thet0ast3r 9 күн бұрын

how about performance?

@tot_ra 9 күн бұрын

Произносится "Тексас" а не техас )

@AntonMaltsev 9 күн бұрын

😁да, иногда клинит)

@thinkIndependent2024 11 күн бұрын

Did you use rhe AI Pro 20 TOPs with 24gb or 8gb 8TOPs

@AntonMaltsev 11 күн бұрын

20TOPS version

@thinkIndependent2024 11 күн бұрын

@@AntonMaltsevinteresting no other SBC has 24gb with 20 TOPs and it performed below a PI 5 with 8gb and no NPU acceleration?

@than1794 12 күн бұрын

Доброго времени суток Какое время предсказания для этой платы ( Inference) для yolo? Если это есть в видео сложно услышать

@aiden_3c 13 күн бұрын

It might be worth looking into this board again! The documentation is a lot better, but still isn't the best... Community documentation being split among two websites is the biggest thing for me. I've been able to get some amazing stuff working, like running small LLMs, Arm and RiscV concurrency, etc. Cross compiling is also a lot easier, still needs a little setup but now I can just run a custom GCC command and use it just like normal GCC. Both for building custom images, and for building code to run on the RiscV processor. Planning on getting Yolo running on here too for some drone footage processing, found this video while trying to find _anyone_ who's posted a video of its performance

@AntonMaltsev 13 күн бұрын

Nice idea. Thanks for catching this. Definitely not now, but it may be a good idea to take some second sophron-based board and check the global progress for the all ecosystem.

@elenikatsioli5360 14 күн бұрын

AI is applied effectively in natural language, performs even better in coding, and excels further in games. A finite set of human rules enables refined statistics and probabilistic methods to function with reasonable accuracy. Reality is philosophically deterministic in terms of causality, but not in a strictly mathematical sense. The challenge for AIs in dealing with reality lies in its non-deterministic nature, a given state does not always lead to the same outcome. There is a lack of abstraction and symbolism, persistent issues in current models that cannot be solved merely through scaling or refinement.

@saltyboiproductions 15 күн бұрын

Good video, hard to follow your speaking pattern. Feel free to shoot multiple takes man. Especially for a 3 minute video. Recommend getting a better microphone for these things or moving closer to your microphone

@QW3RTYUU 14 күн бұрын

It’s a common accent online. Not everyone is US-born-and-raised.

@saltyboiproductions 14 күн бұрын

@QW3RTYUU I said pattern of speaking, not their accent. Probably preventable with a script and multiple takes, as I also mentioned. Bold of you to assume I'm US born and raised lol.

@ContentSafe 15 күн бұрын

One of the few AI related videos not overly dramatic or critical, just showing the current limits. i like it

@randomchannel-px6ho 15 күн бұрын

The real ones know computer scientist promised "AGI anyday now" in 1956... 1956! This isn't a dystopia...

@usamazaheer3109 16 күн бұрын

Zed2i is the best

@fontenbleau 16 күн бұрын

I'm interested for now in videogen models, text-video is always random, but image-video is more interesting. LTXVideo almost there, but finding perfect setting takes hours (in ComfyUi), Ruyi kinda works but only short 5 sec gifs and they can't make it faster (40 minutes to generate on 4070 Ti Su), Hunyan praised as best chinese video gen but i haven't tried images and it's most demanding in processing power & VRAM.

@afarsek_91 16 күн бұрын

Может быть имеет смысл сделать два отдельных канала, один на русском, другой на английском? 😶

@AntonMaltsev 16 күн бұрын

Этот на английском;)

@AntonMaltsev 16 күн бұрын

На русском есть, например, такой - www.youtube.com/@ZlodeiBaal

@AmiprojectDotCom 18 күн бұрын

Спасибо! Жду новых выпусков!

@dnogin 20 күн бұрын

Крутяк, спасибо! Послушал от корки до корки с интересом!

@romarsit1795 23 күн бұрын

AI is overrated

@AntonMaltsev 23 күн бұрын

NO WAY!!!11!

@haithemsekri6612 26 күн бұрын

Thank you man for the great tuto 🙏

@ShopperPlug 27 күн бұрын

This is true, everyone is stating that Yolo11 isn't a big deal. I'm performing many tests and will conclude how better it is. I have a RTX 3080 TI. There is only one reason why I'm working with YOLO11, it has extremely well documentation on how to get things running.

@Alienvlg 28 күн бұрын

Огонь !

@КириллЧе-я5ы 28 күн бұрын

Интересно, а обработка сигнала с выделением каких-то паттернов и признаков тоже вероятно область lvm?.. и стоит ли применять существующие большие модели для этого, скажем запихнув эти модели куда нибудь в пайплайн обёртку вроде langchain? Или мб свою попробовать лучше запилить, если есть датасеты?..

@AntonMaltsev 28 күн бұрын

Ну, если есть большой датасет и признак детерминированный то лучше что-то обучать. langchain и прочее - это все же про логику принятия решений, когда надо много с пространством знания увязывать.

@КириллЧе-я5ы 28 күн бұрын

@ спасибо!

@КириллЧе-я5ы 28 күн бұрын

Народ, с Новым годом!! Вы в Београде, судя по крышам?..☺️

@AntonMaltsev 28 күн бұрын

Не, это на моей кухне, просто вид в другую сторону чем обычно. Все так же в Норвегии.

@СергейОсташко-у7б 29 күн бұрын

Хорошо говориш на русском. Как будто родно ;))

@positivenozy6065 28 күн бұрын

А ты плохо "говориш". Чего-то тебе родно?)

@youknowwhatlol6628 16 күн бұрын

@@positivenozy6065не рідна у нього ця мова, це ж очевидно.

@sashulya-su Ай бұрын

Очень интересно!

@PRiKoL1ST1 Ай бұрын

Все таки лучше с микрофоном

@fontenbleau Ай бұрын

Что вы думаете про китайский Проект Генезис?

@AntonMaltsev Ай бұрын

Этот - github.com/Genesis-Embodied-AI/Genesis ? Я скептически отношусь к всякой генерации, если данные можно проще достать. Обычно если надо генерировать это x3 сложность проекта

@fontenbleau Ай бұрын

@AntonMaltsev спасибо ✍️🤝

@teodorchaly184 Ай бұрын

Interesting video! I watch almost every video on your chanel. I wanted to ask for some advice: I’m planning to create a KZbin channel and wanted to know how to create timestamps for videos. Is it done manually or is there a way to automate the process?

@AntonMaltsev Ай бұрын

Hi, Teodor! I am doing this myself, but I think Gemini can easily do this - cloud.google.com/vertex-ai/generative-ai/docs/samples/generativeaionvertexai-gemini-video-with-audio

@teodorchaly184 Ай бұрын

@@AntonMaltsev O, thank you. You really helped me) But i think gemini summarize video in general, but didn t make timestempts:(

@AntonMaltsev Ай бұрын

@@teodorchaly184 it definitely can. We once tested this in one project.

@vivekranjan2947 Ай бұрын

Hi Anton. I tried running my converted RKNN model on my rk3588 board but am getting an error that the .RKNN model is either formatted wrong or is or corrupt. Can you plz send me your converted model ' yolov8n.rknn' so that I can check my inference.

@AntonMaltsev Ай бұрын

You need convert on corresponding version of RKNPU SDK, rknn-toolkit + export for correct architecture. A random model will not help you.

@Alexnirvan77 Ай бұрын

Anton, have you tried to run inference on different NPU cores? (Im having an issue with it, inference works only while using core 0). Also, is there an option to run inference on all 3 cores?

@AntonMaltsev Ай бұрын

On Friendly Elec 3588, Mekotronics 3576, Orange 3588 and Rock 3588 I tested it. It was sucessfully worked.

@littleyogi3341 Ай бұрын

what challanges we can achive with rk3588??

@AntonMaltsev Ай бұрын

all of them!

@r_ruslan Ай бұрын

Thank you for sharing you helped me a lot

@andreyl2705 Ай бұрын

awesome)

@AlexanderDhoore Ай бұрын

Anton, living in the future :)

@AntonMaltsev Ай бұрын

@q-engineering Ай бұрын

Two remarks. 1) In the absence of the GPU, the Rock5C-Lite is targeted for CLI applications. With the official Debian KDE-OS GUI, your CPU load can be up to 6x 85% when showing videos or RSTP streams on the screen. (Not too handy when it comes to vision tasks) 2) The popular Ubuntu OS of Johua-Riek doesn't support the RK3582 (yet).

@AntonMaltsev Ай бұрын

Thank you, Rients! Yes, totally agree. CLI + no-streaming.

@aniketpatil4430 Ай бұрын

where to buy can anyone please send some links to buy theser products

@harishs2651 Ай бұрын

how can rk3582(4 tops) is faster than rk3588(6 tops)?

@AntonMaltsev Ай бұрын

1) It's not 4TOPS, it's 5TOPS according to the documentation - dl.radxa.com/rock5/5c/docs/hw/datasheet/Rockchip%20RK3582%20Datasheet%20V1.1-20230221.pdf 2) It's not only about NPU performance. It's about memory speed, frequency, etc. The result may differ for different nerworks / drivers/pip wheels.

@andreyl2705 Ай бұрын

awesome)

@flagcrew Ай бұрын

КРАСАВА

Ең жақсы KZbin

Пікірлер