Create Your "Small" Action Model with GPT-4o

  Рет қаралды 5,976

All About AI

All About AI

Күн бұрын

Create Your "Small" Action Model with GPT-4o
👊 Become a member and get access to GitHub and Code:
/ allaboutai
🤖 Great AI Engineer Course:
scrimba.com/learn/aiengineer?...
🔥 Open GitHub Repos:
github.com/AllAboutAI-YT/easy...
📧 Join the newsletter:
www.allabtai.com/newsletter/
🌐 My website:
www.allabtai.com
I try to create my own "small" action model based on Python and the GPT-4o API. Will it work? Lets find out
00:00 Small Action Model GPT-4o Intro
01:48 GPT-4o Action Model Code
05:54 Testing the Model

Пікірлер: 26
@ShpanMan
@ShpanMan 28 күн бұрын
This is actually really impressive. GPT-4o watching you act and understands what is done, then writes code to reproduce it, which can then be run and automated. Very clever flow, OpenAI should definitely hire you.
@MilkGlue-xg5vj
@MilkGlue-xg5vj 27 күн бұрын
Anyone can do better than this with a powerful language model, it's not much. It's just that the rabbit is overrated.
@georgestander2682
@georgestander2682 28 күн бұрын
Thanks, this is interesting. I was wondering about this as well and had a thought about adding log data of user interactions to give the model more telemetry. So it not just vision but also the actual logs of all the interactions happening in the background.
@clumsy_en
@clumsy_en 27 күн бұрын
Cool experimental project and idea 👍 The entire process can be scripted further to continuously store the most recent number of screenshots in 2-second intervals to VRAM using PyTensor, and a call can be triggered at any time with keyword through mic input or keys shortcut to send it to gpt-4o to retrieve the "reply last action script" and then automatically execute it to save time doing some mundane tasks👍👍
@mikew2883
@mikew2883 28 күн бұрын
This is awesome!
@nic-ori
@nic-ori 28 күн бұрын
Useful information. Thank you!👍👍👍
@cyc00000
@cyc00000 27 күн бұрын
So good to see you getting onboard the rabid r1. It's seriously going to change lives.Enjoyed the video man.
@TTOnkeys
@TTOnkeys 26 күн бұрын
I can think of so many uses for this. Great work.
@user-yw9us2qo6g
@user-yw9us2qo6g 28 күн бұрын
Looks great
@gnosisdg8497
@gnosisdg8497 28 күн бұрын
so where is the code for this project! looks fun
@Soft_Touch_
@Soft_Touch_ 28 күн бұрын
I've been thinking recall and omni screenshots were ways to create large pratical data sets to train lams. Do you think that is what's happening? You seem to be doing a smaller version of this
@NetHyTech
@NetHyTech 28 күн бұрын
Bro Plz create video for real time vision and response
@lokeshart3340
@lokeshart3340 28 күн бұрын
Woh woh look whos here bhai kya aap mere ko jante ho ya yaad rkhe ho?
@BThunder30
@BThunder30 27 күн бұрын
Interesting project as always.
@ibrahimaba8966
@ibrahimaba8966 27 күн бұрын
Very interesting. I think it could also be useful to provide it with the mouse positions between different frames. To go further, we could create multiple actions and then implement a RAG that allows the model to choose the correct snapshot and execute it. Thanks for this video.
@ewasteredux
@ewasteredux 28 күн бұрын
Are there any local LLM's this might work with?
@PanduPandu-fh5tk
@PanduPandu-fh5tk 28 күн бұрын
Maybe, LLaVA 13b can
@carstenli
@carstenli 26 күн бұрын
Great start. What's the GH url for subscribers?
@futureworldhealing
@futureworldhealing 28 күн бұрын
learning how to be data scientist 80% from u bro haha
@darthvader4899
@darthvader4899 27 күн бұрын
How does it know where to click though? Does
@kalilinux8682
@kalilinux8682 28 күн бұрын
Humane and Rabbit watching this and raising another round of funding
@avi7278
@avi7278 28 күн бұрын
honestly more legit than scammer Jesse Lyu and RabbitR1 garbage hardware scam after his NFT game scam.
@lokeshart3340
@lokeshart3340 28 күн бұрын
Hello sir can u recreate gemini vision fake demo in real life
@JNET_Reloaded
@JNET_Reloaded 28 күн бұрын
the github is always the same repo btw itl be easyer tomake a new repo for each project and put project link in description
@wurstelei1356
@wurstelei1356 27 күн бұрын
I think you can link to git sub folders. The repo is pretty messy, but keep in mind, this is free. Thou I am also not able to find code for some projects on that repo.
@spencerfunk6697
@spencerfunk6697 28 күн бұрын
So literally open interpreter…
26 Incredible Use Cases for the New GPT-4o
21:58
The AI Advantage
Рет қаралды 724 М.
UFC Vegas 93 : Алмабаев VS Джонсон
02:01
Setanta Sports UFC
Рет қаралды 204 М.
Неприятная Встреча На Мосту - Полярная звезда #shorts
00:59
Полярная звезда - Kuzey Yıldızı
Рет қаралды 3,2 МЛН
⬅️🤔➡️
00:31
Celine Dept
Рет қаралды 38 МЛН
Two GPT-4os interacting and singing
5:55
OpenAI
Рет қаралды 2,8 МЛН
GPT-4o is WAY More Powerful than Open AI is Telling us...
28:18
MattVidPro AI
Рет қаралды 256 М.
Don’t Build AI Products The Way Everyone Else Is Doing It
12:52
Steve (Builder.io)
Рет қаралды 339 М.
You’re using ChatGPT wrong
9:31
Jeff Su
Рет қаралды 340 М.
Humanity Is Not Ready For These AI Voice Conversations.
10:01
It's Jonny Keeley
Рет қаралды 55 М.
Mapping GPT revealed something strange...
1:09:14
Machine Learning Street Talk
Рет қаралды 197 М.
Mind-maps and Flowcharts in ChatGPT! (Insane Results)
13:05
AI Foundations
Рет қаралды 302 М.
Дени против умной колонки😁
0:40
Deni & Mani
Рет қаралды 13 МЛН
Mi primera placa con dios
0:12
Eyal mewing
Рет қаралды 719 М.
Cadiz smart lock official account unlocks the aesthetics of returning home
0:30
How To Unlock Your iphone With Your Voice
0:34
요루퐁 yorupong
Рет қаралды 25 МЛН