Best of 2024 in Agents (from #1 on SWE-Bench Full, Prof. Graham Neubig of OpenHands/AllHands)

  Рет қаралды 1,917

Latent Space

Latent Space

Күн бұрын

Пікірлер: 11
@LatentSpaceTV
@LatentSpaceTV 19 сағат бұрын
links to slides and more context x.com/latentspacepod/status/1871998012467380698
@RafiDude
@RafiDude 16 сағат бұрын
Please pin this comment.
@Lucien-lu1vw
@Lucien-lu1vw 17 сағат бұрын
Amazing demo with the pie charts PR. I played a lot with agents and came to the same conclusions and design choices as Graham Neubig.
@TheNitroPython
@TheNitroPython 12 сағат бұрын
These talks are extremely valuable. I would love to be in person.
@NaveenReddy-p5j
@NaveenReddy-p5j 17 сағат бұрын
Graham Neubig nailed it! Agent tech’s evolving fast, with AllHands on top. 2025 looks exciting for agents!
@AlexJohnson-g4n
@AlexJohnson-g4n 15 сағат бұрын
AllHands leading SWE-Bench Full is impressive! Graham Neubig’s insights were top-notch. 2025 does seem like the year for major agent tech breakthroughs.
@neuronwave
@neuronwave 11 сағат бұрын
Interesting to watch this at Christmas and reflect on how much higher o1-pro (49) and o3 (71.7) are than the performance of models on swe-bench example. Clearly highlights a challenge in 2025 (and beyond) to build things that won't be washed away in the tsunami of new models doing all the agent workbench internals. Especially relevant as the cost per hour of software dev is so high that o1-pro (and probably o3), while expensive, are much cheaper than human coder.
@MatthewSanders-l7k
@MatthewSanders-l7k 17 сағат бұрын
Graham Neubig's keynote on LLM agents is a game-changer! AllHands leading SWE-Bench Full is impressive. Exciting future for agent tech!
@WinonaNagy
@WinonaNagy 17 сағат бұрын
Neubig’s insights are a gem. With AI agents advancing rapidly, are we ready for a paradigm shift in our digital interactions?
@AlternativeTakes
@AlternativeTakes Сағат бұрын
can we stop ai generated answers
@Aedonius
@Aedonius 12 сағат бұрын
Cursor IDE New YOLO MODE is insane.
Windsurf: The Enterprise AI IDE
1:06:36
Latent Space
Рет қаралды 1,5 М.
99.9% IMPOSSIBLE
00:24
STORROR
Рет қаралды 31 МЛН
To Brawl AND BEYOND!
00:51
Brawl Stars
Рет қаралды 17 МЛН
“Don’t stop the chances.”
00:44
ISSEI / いっせい
Рет қаралды 62 МЛН
Best of 2024 in Vision [LS Live @ NeurIPS]
55:46
Latent Space
Рет қаралды 474
0 to over $8M ARR in 2 months as a Claude Wrapper (Bolt.new, Qodo)
1:36:42
Interview of Arthur Gretton ML Researcher at Google DeepMind
18:14
ML New Papers
Рет қаралды 4,4 М.
SpaceX's shows new Starship Heat Tiles! And they're red!
19:56
What about it!?
Рет қаралды 418 М.
Agents @ Work: Lindy.ai (with live demo!)
1:08:01
Latent Space
Рет қаралды 3,1 М.
Best of 2024: Open Models [LS LIVE! at NeurIPS 2024]
37:29
Latent Space
Рет қаралды 1,5 М.
The State of AI Startups in 2024 [LS Live @ NeurIPS]
26:35
Latent Space
Рет қаралды 1,8 М.
99.9% IMPOSSIBLE
00:24
STORROR
Рет қаралды 31 МЛН