Learning Robot Control: From RL to Differential Simulation - (PhD Defense of Yunlong Song)

  Рет қаралды 4,138

UZH Robotics and Perception Group

UZH Robotics and Perception Group

Күн бұрын

This thesis focuses on Learning Robot Control by integrating deep reinforcement learning (RL) and model-based control methods. It aims to develop advanced control methods that bridge the gap between data-driven learning and model-based control. The proposed methods enhance robot agility and robustness in real-world applications.
Key contributions are:
- Show that RL outperforms Optimal Control in autonomous racing because it directly optimizes a non-differentiable task-level objective.
- Propose a policy-search-for-model-predictive-control (MPC) framework, combining RL's ability to optimize high-level task objectives with MPC's precise actuation and constraint handling.
- Introduce a differentiable simulation framework to leverage robot dynamics for more stable and - efficient policy training.
- Develop a high-performance drone racing system outperforming optimal control methods and professional pilots.
- Develop Flightmare, a flexible modular quadrotor simulator for reinforcement learning and vision-based flight.
OUTLINE:
00:00 - Introduction
02:37 - Robot Control: An Optimal Control Perspective
03:14 - Robot Control: A Reinforcement Learning Perspective
05:06 - Project 1: Autonomous Drone Racing: Optimal Control vs. Reinforcement Learning
12:05 - Project 2: Flying Through Dynamic Gates: Reinforcement Learning for Optimal Control
16:04 - Project 3: Quadrupedal Locomotion: Differentiable Simulation
20:18 - Conclusions
23:05 - One More Thing

Пікірлер: 12
@zebinhuang6352
@zebinhuang6352 6 күн бұрын
Congratulations
@xiaotiandai
@xiaotiandai 4 күн бұрын
Good job
@leihe7188
@leihe7188 Ай бұрын
Congratulations! Very nice work!
@kehanlong648
@kehanlong648 28 күн бұрын
Amazing work!
@eeshiba8505
@eeshiba8505 Ай бұрын
Nice work! Congrats!
@user-kt9kz5pe7w
@user-kt9kz5pe7w Ай бұрын
amazing work!
@yumaoliu5319
@yumaoliu5319 Ай бұрын
Congratulation! Dr. Song.
@ashfaquekhan7282
@ashfaquekhan7282 26 күн бұрын
amazing work Dr. Song
@ChanJoon
@ChanJoon 29 күн бұрын
Congratulation! Very impressive! I also want to be a researcher like you!
@user-hw1ge9df9k
@user-hw1ge9df9k Ай бұрын
Very impressive and excellent work! Congratulation! Dr. Song. May I ask you a simple question? How do you train the RL NN so that the drone know the order of gate to pass? 1st gate, 2nd gate, .. , how the drone trained to know the sequence of gates to pass? Is there any mark on the gate corner so that the sequence can be visually recognized?
@QiWei_cs
@QiWei_cs Ай бұрын
Congrats! BTW, can you share your PhD thesis?
@xiaodaochen
@xiaodaochen Ай бұрын
Congratulations
Human-Level Performance with Autonomous Vision-based Drones - Davide Scaramuzza
1:26:23
UZH Robotics and Perception Group
Рет қаралды 11 М.
PID vs. Other Control Methods: What's the Best Choice
10:33
RealPars
Рет қаралды 115 М.
ПРОВЕРИЛ АРБУЗЫ #shorts
00:34
Паша Осадчий
Рет қаралды 7 МЛН
لااا! هذه البرتقالة مزعجة جدًا #قصير
00:15
One More Arabic
Рет қаралды 14 МЛН
Mom's Unique Approach to Teaching Kids Hygiene #shorts
00:16
Fabiosa Stories
Рет қаралды 38 МЛН
The Clever Way to Count Tanks - Numberphile
16:45
Numberphile
Рет қаралды 700 М.
Why Does Diffusion Work Better than Auto-Regression?
20:18
Algorithmic Simplicity
Рет қаралды 258 М.
The Greenwich Meridian is in the wrong place
25:07
Stand-up Maths
Рет қаралды 420 М.
How to train simple AIs to balance a double pendulum
24:59
Pezzza's Work
Рет қаралды 213 М.
Joseph Suarez Thesis Defense - Neural MMO
1:00:06
Neural MMO
Рет қаралды 126 М.
Microsoft Is KILLING Windows | ft. Steve @GamersNexus
19:19
Level1Techs
Рет қаралды 289 М.
iPhone 15 Pro в реальной жизни
24:07
HUDAKOV
Рет қаралды 498 М.
Проверил, как вам?
0:58
Коннор
Рет қаралды 363 М.
Ba Travel Smart Phone Charger
0:42
Tech Official
Рет қаралды 1,2 МЛН
Tag him😳💕 #miniphone #iphone #samsung #smartphone #fy
0:11
Pockify™
Рет қаралды 4,5 МЛН
My iPhone 15 pro max 😱🫣😂
0:21
Nadir Show
Рет қаралды 1,8 МЛН