AI "Behavior Cloning" for fast Residual RL (MIT, Harvard)

NEW IDEA: RL-based Fine-Tuning (Princeton, UC Berkeley)

What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata

ДОСЫМЖАН ЕКЕУМІЗГЕ 6 ОЙЫНШЫ ЖАБЫЛДЫ!

Did She Really Cut Her Hair?!😅 I'm Not Surprised Why She Crafted A DIY Hairdresser! #shorts

Камеди Клаб «Семейный психолог» Гарик Харламов, Марина Кравец, Марина Федункив

Рабочая миниатюрная модель Автомата Калашникова. АК-47

AI "Behavior Cloning" for fast Residual RL (MIT, Harvard)

Рет қаралды 1,682

Discover AI

Discover AI

Күн бұрын

Latest AI research for imitation learning (IL), w/ focus on Behavioral Cloning a student policy from a teacher AI sys. Sim-to-Real Transfer.
Imagine the future of space exploration where robots autonomously build habitats for humans on Mars. This vision is becoming reality through the advanced interplay of behavior cloning and reinforcement learning. Utilizing AI's behavioral cloning, an agent learns from human experts to perform precise tasks like assembling a Martian shelter. The approach combines the stability of pre-trained models with the adaptability of reinforcement learning to refine robotic actions. This hybrid method ensures the robots can handle the unpredictable Martian environment, offering a pragmatic solution for autonomous construction with limited computational resources.
The innovation lies in the meticulous layering of AI techniques. Initially, a behavior cloning model learns from human demonstrations, forming a foundational policy. This base is further refined using reinforcement learning, which provides small but critical corrections, optimizing the robot's performance without destabilizing the original model. By generating vast synthetic data from simulated environments, the system gains robustness, bridging the gap between controlled simulations and the dynamic real-world conditions on Mars. This method not only enhances the robot's precision but also streamlines its operational complexity, making it feasible to run on the limited hardware available everywhere, also on Mars and satellites of the outer solar system.
All rights w/ authors:
From Imitation to Refinement -
Residual RL for Precise Visual Assembly
arxiv.org/pdf/...
#airesearch
#newtechnology
#robotics

Пікірлер: 2

@MusingsAndIdeas

@MusingsAndIdeas 6 ай бұрын

This is disturbingly similar to children learning basic tasks, and then getting better with practice

@Hshjshshjsj72727

@Hshjshshjsj72727 2 ай бұрын

Yes whats wrong with that 😂

NEW IDEA: RL-based Fine-Tuning (Princeton, UC Berkeley)

42:56

NEW IDEA: RL-based Fine-Tuning (Princeton, UC Berkeley)

Discover AI

Рет қаралды 2,8 М.

What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata

46:02

What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata

The Royal Institution

Рет қаралды 1,2 МЛН

ДОСЫМЖАН ЕКЕУМІЗГЕ 6 ОЙЫНШЫ ЖАБЫЛДЫ!

18:28

ДОСЫМЖАН ЕКЕУМІЗГЕ 6 ОЙЫНШЫ ЖАБЫЛДЫ!

EROOKA

Рет қаралды 111 М.

Did She Really Cut Her Hair?!😅 I'm Not Surprised Why She Crafted A DIY Hairdresser! #shorts

0:46

Did She Really Cut Her Hair?!😅 I'm Not Surprised Why She Crafted A DIY Hairdresser! #shorts

Cool Tool Shorts

Рет қаралды 82 МЛН

Камеди Клаб «Семейный психолог» Гарик Харламов, Марина Кравец, Марина Федункив

14:33

Камеди Клаб «Семейный психолог» Гарик Харламов, Марина Кравец, Марина Федункив

Comedy Club

Рет қаралды 10 МЛН

Рабочая миниатюрная модель Автомата Калашникова. АК-47

0:34

Рабочая миниатюрная модель Автомата Калашникова. АК-47

Status

Рет қаралды 4,7 МЛН

Official PyTorch Documentary: Powering the AI Revolution

35:53

Official PyTorch Documentary: Powering the AI Revolution

PyTorch

Рет қаралды 203 М.

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

57:45

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Grant Sanderson

Рет қаралды 415 М.

How language model post-training is done today

53:51

How language model post-training is done today

Interconnects AI

Рет қаралды 6 М.

Multi Agent & Multi Modal AI does Physics (MIT)

32:17

Multi Agent & Multi Modal AI does Physics (MIT)

Discover AI

Рет қаралды 3,6 М.

8 НЕОЧЕВИДНЫХ МИНУСОВ ЖИЗНИ В ЯПОНИИ, о которых редко говорят

18:08

8 НЕОЧЕВИДНЫХ МИНУСОВ ЖИЗНИ В ЯПОНИИ, о которых редко говорят

ToriChyanChannel

Рет қаралды 146 М.

Who has the Worst Setup at Linus Tech Tips

29:05

Who has the Worst Setup at Linus Tech Tips

Linus Tech Tips

Рет қаралды 2,5 МЛН

The Man Who Solved the $1 Million Math Problem...Then Disappeared

10:45

The Man Who Solved the $1 Million Math Problem...Then Disappeared

Newsthink

Рет қаралды 1,5 МЛН

Generative AI in a Nutshell - how to survive and thrive in the age of AI

17:57

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Henrik Kniberg

Рет қаралды 2,8 МЛН

Reinforcement Learning - My Algorithm vs State of the Art

19:32

Reinforcement Learning - My Algorithm vs State of the Art

Pezzza's Work

Рет қаралды 156 М.

The moment we stopped understanding AI [AlexNet]

17:38

The moment we stopped understanding AI [AlexNet]

Welch Labs

Рет қаралды 1,6 МЛН

Лайфхак: Легально делать деньги

0:43

Лайфхак: Легально делать деньги

RICARDO

Рет қаралды 2 МЛН

Ей что, мои трусы не понравились?😢 #shorts

0:10

Ей что, мои трусы не понравились?😢 #shorts

Foton

Рет қаралды 191 М.

Robot 🤖 cleaning 🧹

0:57

Robot 🤖 cleaning 🧹

Bunnal 𝚃𝚎𝚌𝚑

Рет қаралды 4,7 МЛН

КАК ЖИВЕТ КВАНТУМ? РУМ ТУР КВАНТУМА!!!

13:51

КАК ЖИВЕТ КВАНТУМ? РУМ ТУР КВАНТУМА!!!

Quantum Games

Рет қаралды 283 М.

#VINE⚡НАШЛА У МУЖА В ТЕЛЕФОНЕ 🤣🤣🤣#ludoksashok #тикток #людасаша

0:29

#VINE⚡НАШЛА У МУЖА В ТЕЛЕФОНЕ 🤣🤣🤣#ludoksashok #тикток #людасаша

LUDOKSASHOK

Рет қаралды 323 М.

😱Esses SERVENTES desafiaram as leis do TEMPO !?(TRAÇO RÁPIDO)😮🎖💪🏅✅#shorts #youtubeshorts

0:57

😱Esses SERVENTES desafiaram as leis do TEMPO !?(TRAÇO RÁPIDO)😮🎖💪🏅✅#shorts #youtubeshorts

em busca dos pedreiros notáveis

Рет қаралды 3,1 МЛН

ПОСТАРЕЛА ЗА 1 ДЕНЬ НА 20 ЛЕТ - МУЖСКОЕ ЖЕНСКОЕ

55:44

ПОСТАРЕЛА ЗА 1 ДЕНЬ НА 20 ЛЕТ - МУЖСКОЕ ЖЕНСКОЕ

ПРИЯТНЫЙ ИЛЬДАР

Рет қаралды 677 М.