Hye thanks for all these videos. But can you please make a video about mili csi virtual channel and how can we use them explaining everything required
@Andreas_Hopf17 сағат бұрын
What would be relevant for creators is if there were a way to feed in, say, six Midjourney images of the same object, showing it from four sides plus top and bottom, and then have a quality 3D mesh generated from that. So far, Midjourney and others fail to provide six views of the same object. Any ideas?
@antonrrtyq18 сағат бұрын
paligemma же умеет детектить
@r_ruslan18 сағат бұрын
@AntonMaltsev I have RK3576 2NPU and 8CPU and my camera fps is 30. I used that one but still not improved. Is it possible to combine cpu+npu? how?
@kuliev.vitaly21 сағат бұрын
internvl2 уже имела функционал детекции.
@ARMENIA181Күн бұрын
Thank you for video, can you please create video how to run Qwen2.5-VL on rockchip.
@AntonMaltsevКүн бұрын
It's a little bit complicated right now (Qwen required latest drivers, which are still not available for the main fork of the system). It's possible to do, but for now I need my board with different system (only one of my boards are capable to run qwen). So, definitely, at some point I will try to. But not now.
@goldennotchcreeper4838Күн бұрын
Very good video! Been watching you for a while now :)
@meirakrutkrut41433 күн бұрын
Здраствуйте, извиняюсь что не по теме видео, просто только недавно наткнулся на ваш канал. Я начал интересоваться комптютерным зрением. Эта область мне нравится. Однако заметил, что возможно, может быть лучше войти в эту область со смежной специальности, скажем, обычного ПО-разработчика и тд? Мне интересно, как лучше поступить, чтобы наиболее спокойно войти в эту область на рынке, не оставшись без работы. Просто я подумал, что пороговый вход тут сожет быть большим. Что вы можете посоветовать мне по этому поводу?
@PankajDoharey7 күн бұрын
LLM based AI is far from diagnosing my water filter issues. I need a water filter repairmen for that.
@vitaliy_dushepa7 күн бұрын
livebench ai benchmark is quite good.
@vitalyl13277 күн бұрын
AGI is already here. I'm not an influencer. I'm an engineer, working on agentic systems built on top of small (below 14B) LLMs. My systems are capable of general problem solving - simply by offloading all the reasoning to external tools (such as Prolog and SMT solvers). I do not believe we need LLMs to be able to reason on their own, we already have a much better and more precise reasoning engine.
@tempacc95897 күн бұрын
😂😂😂😂
@vitalyl13277 күн бұрын
@tempacc9589 what's so funny? Did you try to offload reasoning to external logical inference? It works way better than human reasoning, scales to any size of the problem.
@jiayojames7 күн бұрын
The issue is that "AGI" is such a badly defined concept. There's going to be people who refuse to acknowledge that something is an AGI unless it has a mind exactly like a human... which of course would be a totally illogical expectation for how AI will progress. Ultimately the only real way to measure it is in the accuracy, competence, and flexibility of task completion, regardless of whatever reductionist arguments the critics will level in regards to how it gets there.
@vitalyl13277 күн бұрын
@ it does not really matter what people think and how they define things. What matters is the tasks this system can perform autonomously. And if these tasks overlap with everything humans do and it can replace humans and do their work better and faster - it's and AGI. I'm only interested in replacing engineers, so don't really care about proving anything beyond this narrow niche.
@s1v78 күн бұрын
чувак, продумай заранее что хочешь сказать, мысль скачет туда-сюда - трудно понят что ты вообще хочешь сказать.
@fontenbleau8 күн бұрын
Yes, the solution is leaking into datasets, but deepseek v2,5 in my local offline test was the only one which repaired very unusual code, all others qwen or mistral Large can't repair that logical problem. I'm testing locally now V3 (next their r1) and it's quite good, the only i need to raise from Q5 to Q6 because quality lacks on low quants. But even on Q5 it's trying hard to write me a Bach symphony in sound programming language. I've noticed how it's self tuned from dumping the all code in every answer, which is sign of dumb models like qwen or mistral, but now it's writing only parts to edit or replace, which saves tokens.
@AntonMaltsev8 күн бұрын
Definitely, DeepSeek looks nice. But I am sure they used a lot of GPT generation for it, which is not a completely good solution.
@andreyl27058 күн бұрын
Anton, what do you think about AI camera "reCamera 2002" from seed studio?
@AntonMaltsev8 күн бұрын
It's the same SG2002 processor that Milk-V has. But software realization is much better. I expect something like this - kzbin.info/www/bejne/noTZZWxui92Cqqc But did not try it myself.
@thet0ast3r9 күн бұрын
how about performance?
@tot_ra9 күн бұрын
Произносится "Тексас" а не техас )
@AntonMaltsev9 күн бұрын
😁да, иногда клинит)
@thinkIndependent202411 күн бұрын
Did you use rhe AI Pro 20 TOPs with 24gb or 8gb 8TOPs
@AntonMaltsev11 күн бұрын
20TOPS version
@thinkIndependent202411 күн бұрын
@@AntonMaltsevinteresting no other SBC has 24gb with 20 TOPs and it performed below a PI 5 with 8gb and no NPU acceleration?
@than179412 күн бұрын
Доброго времени суток Какое время предсказания для этой платы ( Inference) для yolo? Если это есть в видео сложно услышать
@aiden_3c13 күн бұрын
It might be worth looking into this board again! The documentation is a lot better, but still isn't the best... Community documentation being split among two websites is the biggest thing for me. I've been able to get some amazing stuff working, like running small LLMs, Arm and RiscV concurrency, etc. Cross compiling is also a lot easier, still needs a little setup but now I can just run a custom GCC command and use it just like normal GCC. Both for building custom images, and for building code to run on the RiscV processor. Planning on getting Yolo running on here too for some drone footage processing, found this video while trying to find _anyone_ who's posted a video of its performance
@AntonMaltsev13 күн бұрын
Nice idea. Thanks for catching this. Definitely not now, but it may be a good idea to take some second sophron-based board and check the global progress for the all ecosystem.
@elenikatsioli536014 күн бұрын
AI is applied effectively in natural language, performs even better in coding, and excels further in games. A finite set of human rules enables refined statistics and probabilistic methods to function with reasonable accuracy. Reality is philosophically deterministic in terms of causality, but not in a strictly mathematical sense. The challenge for AIs in dealing with reality lies in its non-deterministic nature, a given state does not always lead to the same outcome. There is a lack of abstraction and symbolism, persistent issues in current models that cannot be solved merely through scaling or refinement.
@saltyboiproductions15 күн бұрын
Good video, hard to follow your speaking pattern. Feel free to shoot multiple takes man. Especially for a 3 minute video. Recommend getting a better microphone for these things or moving closer to your microphone
@QW3RTYUU14 күн бұрын
It’s a common accent online. Not everyone is US-born-and-raised.
@saltyboiproductions14 күн бұрын
@QW3RTYUU I said pattern of speaking, not their accent. Probably preventable with a script and multiple takes, as I also mentioned. Bold of you to assume I'm US born and raised lol.
@ContentSafe15 күн бұрын
One of the few AI related videos not overly dramatic or critical, just showing the current limits. i like it
@randomchannel-px6ho15 күн бұрын
The real ones know computer scientist promised "AGI anyday now" in 1956... 1956! This isn't a dystopia...
@usamazaheer310916 күн бұрын
Zed2i is the best
@fontenbleau16 күн бұрын
I'm interested for now in videogen models, text-video is always random, but image-video is more interesting. LTXVideo almost there, but finding perfect setting takes hours (in ComfyUi), Ruyi kinda works but only short 5 sec gifs and they can't make it faster (40 minutes to generate on 4070 Ti Su), Hunyan praised as best chinese video gen but i haven't tried images and it's most demanding in processing power & VRAM.
@afarsek_9116 күн бұрын
Может быть имеет смысл сделать два отдельных канала, один на русском, другой на английском? 😶
@AntonMaltsev16 күн бұрын
Этот на английском;)
@AntonMaltsev16 күн бұрын
На русском есть, например, такой - www.youtube.com/@ZlodeiBaal
@AmiprojectDotCom18 күн бұрын
Спасибо! Жду новых выпусков!
@dnogin20 күн бұрын
Крутяк, спасибо! Послушал от корки до корки с интересом!
@romarsit179523 күн бұрын
AI is overrated
@AntonMaltsev23 күн бұрын
NO WAY!!!11!
@haithemsekri661226 күн бұрын
Thank you man for the great tuto 🙏
@ShopperPlug27 күн бұрын
This is true, everyone is stating that Yolo11 isn't a big deal. I'm performing many tests and will conclude how better it is. I have a RTX 3080 TI. There is only one reason why I'm working with YOLO11, it has extremely well documentation on how to get things running.
@Alienvlg28 күн бұрын
Огонь !
@КириллЧе-я5ы28 күн бұрын
Интересно, а обработка сигнала с выделением каких-то паттернов и признаков тоже вероятно область lvm?.. и стоит ли применять существующие большие модели для этого, скажем запихнув эти модели куда нибудь в пайплайн обёртку вроде langchain? Или мб свою попробовать лучше запилить, если есть датасеты?..
@AntonMaltsev28 күн бұрын
Ну, если есть большой датасет и признак детерминированный то лучше что-то обучать. langchain и прочее - это все же про логику принятия решений, когда надо много с пространством знания увязывать.
@КириллЧе-я5ы28 күн бұрын
@ спасибо!
@КириллЧе-я5ы28 күн бұрын
Народ, с Новым годом!! Вы в Београде, судя по крышам?..☺️
@AntonMaltsev28 күн бұрын
Не, это на моей кухне, просто вид в другую сторону чем обычно. Все так же в Норвегии.
@СергейОсташко-у7б29 күн бұрын
Хорошо говориш на русском. Как будто родно ;))
@positivenozy606528 күн бұрын
А ты плохо "говориш". Чего-то тебе родно?)
@youknowwhatlol662816 күн бұрын
@@positivenozy6065не рідна у нього ця мова, це ж очевидно.
@sashulya-suАй бұрын
Очень интересно!
@PRiKoL1ST1Ай бұрын
Все таки лучше с микрофоном
@fontenbleauАй бұрын
Что вы думаете про китайский Проект Генезис?
@AntonMaltsevАй бұрын
Этот - github.com/Genesis-Embodied-AI/Genesis ? Я скептически отношусь к всякой генерации, если данные можно проще достать. Обычно если надо генерировать это x3 сложность проекта
@fontenbleauАй бұрын
@AntonMaltsev спасибо ✍️🤝
@teodorchaly184Ай бұрын
Interesting video! I watch almost every video on your chanel. I wanted to ask for some advice: I’m planning to create a KZbin channel and wanted to know how to create timestamps for videos. Is it done manually or is there a way to automate the process?
@AntonMaltsevАй бұрын
Hi, Teodor! I am doing this myself, but I think Gemini can easily do this - cloud.google.com/vertex-ai/generative-ai/docs/samples/generativeaionvertexai-gemini-video-with-audio
@teodorchaly184Ай бұрын
@@AntonMaltsev O, thank you. You really helped me) But i think gemini summarize video in general, but didn t make timestempts:(
@AntonMaltsevАй бұрын
@@teodorchaly184 it definitely can. We once tested this in one project.
@vivekranjan2947Ай бұрын
Hi Anton. I tried running my converted RKNN model on my rk3588 board but am getting an error that the .RKNN model is either formatted wrong or is or corrupt. Can you plz send me your converted model ' yolov8n.rknn' so that I can check my inference.
@AntonMaltsevАй бұрын
You need convert on corresponding version of RKNPU SDK, rknn-toolkit + export for correct architecture. A random model will not help you.
@Alexnirvan77Ай бұрын
Anton, have you tried to run inference on different NPU cores? (Im having an issue with it, inference works only while using core 0). Also, is there an option to run inference on all 3 cores?
@AntonMaltsevАй бұрын
On Friendly Elec 3588, Mekotronics 3576, Orange 3588 and Rock 3588 I tested it. It was sucessfully worked.
@littleyogi3341Ай бұрын
what challanges we can achive with rk3588??
@AntonMaltsevАй бұрын
all of them!
@r_ruslanАй бұрын
Thank you for sharing you helped me a lot
@andreyl2705Ай бұрын
awesome)
@AlexanderDhooreАй бұрын
Anton, living in the future :)
@AntonMaltsevАй бұрын
:D
@q-engineeringАй бұрын
Two remarks. 1) In the absence of the GPU, the Rock5C-Lite is targeted for CLI applications. With the official Debian KDE-OS GUI, your CPU load can be up to 6x 85% when showing videos or RSTP streams on the screen. (Not too handy when it comes to vision tasks) 2) The popular Ubuntu OS of Johua-Riek doesn't support the RK3582 (yet).
where to buy can anyone please send some links to buy theser products
@harishs2651Ай бұрын
how can rk3582(4 tops) is faster than rk3588(6 tops)?
@AntonMaltsevАй бұрын
1) It's not 4TOPS, it's 5TOPS according to the documentation - dl.radxa.com/rock5/5c/docs/hw/datasheet/Rockchip%20RK3582%20Datasheet%20V1.1-20230221.pdf 2) It's not only about NPU performance. It's about memory speed, frequency, etc. The result may differ for different nerworks / drivers/pip wheels.