Testing Frontier LLMs (GPT4) on ARC-AGI

  Рет қаралды 3,769

ARC Prize

ARC Prize

Күн бұрын

Template: www.kaggle.com...
arcprize.org/l...
arcprize.org/a...
ARC Prize is a $1,000,000+ public competition to beat and open source a solution to the ARC-AGI benchmark.
Hosted by Mike Knoop (Co-founder, Zapier) and François Chollet (Creator of ARC-AGI, Keras).
--
Website: arcprize.org/
Twitter/X: / arcprize
Newsletter: Signup @ arcprize.org/
Discord: / discord
Try your first ARC-AGI tasks: arcprize.org/play

Пікірлер: 19
@conformist
@conformist 3 ай бұрын
first.
@cyb3rvoid
@cyb3rvoid 3 ай бұрын
That was unreal!
@conformist
@conformist 3 ай бұрын
@@cyb3rvoid for my next magic trick, i will solve the agi price first
@wwkk4964
@wwkk4964 3 ай бұрын
​@@conformistsolve it backwards!
@filipgara3444
@filipgara3444 3 ай бұрын
Ensure diversity in your model
@MarkoTManninen
@MarkoTManninen 3 ай бұрын
I understand retries, but I am confuced with the two attempts. Do you always need to provide two? In which case they would have different data and both would be required for 100% correct prediction? I also missed the part in which the prediction and correct answers are matched and prounounced.
@ARCprize
@ARCprize 3 ай бұрын
Sorry this isn't more clear on the video! You get two tried at each task. Old competitions had 3 tries. So you can basically give two attempts. If either are correct you pass the task. Under scoring methodology there is more information: arcprize.org/guide#submissions
@LimeTubeH
@LimeTubeH 3 ай бұрын
I'm confused...what are we supposed to attach with our API add-on secret?
@ARCprize
@ARCprize 3 ай бұрын
What do you mean attach? That’s where you put your API key and then reference it in your code
@duncansmothers
@duncansmothers Ай бұрын
this is really helpful/thoughtful just joining the competition and this is an exceptional resource.
@ARCprize
@ARCprize Ай бұрын
Awesome! Glad to hear it. Let us know if you have any questions Duncan
@johnkintner
@johnkintner 2 ай бұрын
third since no one called it :kappa:
@aluphshahim5808
@aluphshahim5808 3 ай бұрын
Second 😂
@jackq2331
@jackq2331 2 ай бұрын
Excellent.
@ARCprize
@ARCprize 2 ай бұрын
Thank you!
@sp3ct3rgaming46
@sp3ct3rgaming46 2 ай бұрын
i might be tripping but i think this dude cloned his own voice and then layered it into the video. you can hear the typical elevenlabs lisp
@ARCprize
@ARCprize 2 ай бұрын
@@sp3ct3rgaming46 you’re tripping. I did the video and no voice dub used
How ChatGPT Built My App in Minutes 🤯
8:28
Website Learners
Рет қаралды 2,5 МЛН
Kubernetes 101 workshop - complete hands-on
3:56:03
Kubesimplify
Рет қаралды 1,6 МЛН
Nastya and balloon challenge
00:23
Nastya
Рет қаралды 68 МЛН
100 Identical Twins Fight For $250,000
35:40
MrBeast
Рет қаралды 54 МЛН
GIANT Gummy Worm Pt.6 #shorts
00:46
Mr DegrEE
Рет қаралды 99 МЛН
Officer Rabbit is so bad. He made Luffy deaf. #funny #supersiblings #comedy
00:18
Funny superhero siblings
Рет қаралды 12 МЛН
General Intelligence: Define it, measure it, build it
53:41
ARC Prize
Рет қаралды 8 М.
xAI introduces Grok-2 | Stronger than Claude 3.5 Sonnet!? (Tested)
31:13
22-Year-Old Immigrant Made $700K in 3 Months with AI
16:37
AppSumo
Рет қаралды 337 М.
Could AI solve this puzzle? (ARC-Game)
18:42
Yannic Out Of Distribution
Рет қаралды 4,4 М.
How Might We Learn?
55:29
Andy Matuschak
Рет қаралды 2,1 М.
Explore ARC-AGI Data + Play
11:03
ARC Prize
Рет қаралды 7 М.
Has Generative AI Already Peaked? - Computerphile
12:48
Computerphile
Рет қаралды 994 М.
Unlimited AI Agents running locally with Ollama & AnythingLLM
15:21
Tim Carambat
Рет қаралды 136 М.
Why Agent Frameworks Will Fail (and what to use instead)
19:21
Dave Ebbelaar
Рет қаралды 65 М.
Nastya and balloon challenge
00:23
Nastya
Рет қаралды 68 МЛН