4 HARD Challenges for Claude Computer Use: Very Promising Results for AI Agents!

  Рет қаралды 6,323

All About AI

All About AI

Күн бұрын

Пікірлер: 24
@Monuccestduletpou
@Monuccestduletpou 2 күн бұрын
What I would personnaly like to see is testing the AIs with real professional use cases, using same long and detailled prompts. The mail test seems the closest thing to that, but I had in mine things like a full website, a tiny e-commerce store, a food delivery app, a set up of multiples docker images organized with Swarm or Kubernetes… The benchmarks with standardized prompts like to-do app or tetris Game are interesting to compare the theorical intelligence, but I can’t prevent myself from thinking that beside benchmarking for the sake of it, the use cases that we see in general (not only this video) are not meant for the professional world. But that’s just my personnal opinion.
@Phonognomiks
@Phonognomiks Күн бұрын
Exactly
@andreinikiforov2671
@andreinikiforov2671 3 күн бұрын
You are doing cutting-edge content, as always! By the way, 93.7% is 123 IQ points -pretty good!
@John-il4mp
@John-il4mp 2 күн бұрын
It is 127 the real number ;) even better.
@John-il4mp
@John-il4mp 2 күн бұрын
Being in the top 6.3% corresponds to about the 93.7th percentile. Using a normal distribution table or calculator, this percentile roughly aligns with an IQ score of 127.
@nathank5140
@nathank5140 2 күн бұрын
Amazing. Love to see someone showing what’s possible.
@herramientak
@herramientak 2 күн бұрын
¿Cuál fue el precio total de las cuatro pruebas?
@rjackstheartofwealth6152
@rjackstheartofwealth6152 17 сағат бұрын
How much did it cost??????
@lyeln
@lyeln 3 күн бұрын
"AI has no owner" Jokes asides impressive quality, thank you for sharing this experiment!
@sirrobinofloxley7156
@sirrobinofloxley7156 3 күн бұрын
Amazing stuff, really nailed it there, though I'm surprised Firefox doesn't have a dark theme?
@godonholiday
@godonholiday 2 күн бұрын
You should test if it can pass the google ‘not a robot’ security tests were you have to select all the pictures of cars etc..
@derrelecte
@derrelecte Күн бұрын
I am trying to experiment with Claude using your tutorials. I want Claude to create a video snippet for me. I'm trying to get Claude to download and install lightworks but I'm running into tons of issues. Do you have any advice?
@JNET_Reloaded
@JNET_Reloaded 2 күн бұрын
you didnt show the cost at the end :/
@fearhand
@fearhand 3 күн бұрын
Couldn't something like Make or Zapier do something like this more efficiently through API calls? Or even have the agent itself use API calls instead of GUI web based interactions.
@Mookummockup
@Mookummockup 3 күн бұрын
Yes but you don't have to deal with as many syntax issues this way. Probably less efficient if zapier etc can handle it but way more flexible
@Newsinrealestate
@Newsinrealestate 2 күн бұрын
Are you using something ADDED on Claude??
@Phonognomiks
@Phonognomiks Күн бұрын
“Computer Use”?
@JNET_Reloaded
@JNET_Reloaded 2 күн бұрын
good job its not outl;ook it would be doing spammy stuff rn lol
@dewijones92
@dewijones92 3 күн бұрын
awesome
@noviceartisan
@noviceartisan 2 күн бұрын
That percentile number = an IQ of 123
@noviceartisan
@noviceartisan 2 күн бұрын
For you next challenge, use ONSHAPE (browser based 3d modelling program) to create a 3D model of some complexity ;)
@ElishaGefen-t5w
@ElishaGefen-t5w 3 күн бұрын
First
5 CHALLENGES for Claude Computer Use: Here's What Happened
21:18
All About AI
Рет қаралды 177 М.
Microsoft`s New INSANE AI AGENT Magentic-1 - TESTED
17:01
All About AI
Рет қаралды 1,6 М.
Человек паук уже не тот
00:32
Miracle
Рет қаралды 4,1 МЛН
СКОЛЬКО ПАЛЬЦЕВ ТУТ?
00:16
Masomka
Рет қаралды 1,9 МЛН
Perfect Pitch Challenge? Easy! 🎤😎| Free Fire Official
00:13
Garena Free Fire Global
Рет қаралды 81 МЛН
Из какого города смотришь? 😃
00:34
МЯТНАЯ ФАНТА
Рет қаралды 1,7 МЛН
Step-by-Step CrewAI Agent Build - Real Use Case! (Part 1)
28:32
Matthew Berman
Рет қаралды 31 М.
ЗАКЛАДКА. Документальный фильм
53:56
Balgabayev Doc
Рет қаралды 696 М.
No One Hires Jr Devs So I Made A Game
39:31
ThePrimeTime
Рет қаралды 310 М.
10 AI Animation Tools You Won’t Believe are Free
16:02
Futurepedia
Рет қаралды 335 М.
What are AI Agents?
12:29
IBM Technology
Рет қаралды 631 М.
4 Insanely Useful AI Tools for Research (Use them today)
12:26
Andy Stapleton
Рет қаралды 9 М.
Huge Claude Updates, AI-Generated Minecraft & More AI Use Cases
17:54
The AI Advantage
Рет қаралды 35 М.
7 New AI Tools You Won't Believe Exist
14:09
Skill Leap AI
Рет қаралды 81 М.
Человек паук уже не тот
00:32
Miracle
Рет қаралды 4,1 МЛН