Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence

Understanding Reinforcement Learning error in image-based environments

Vanilla Bayesian Optimization Performs Great in High Dimensions

НУБ И ПРО СТРОЯТ ЗАЩИЩЕННУЮ ТЮРЬМУ ЗА 10 СЕКУНД / 1 МИНУТА / 5 МИНУТ В МАЙНКРАФТ БИТВА СТРОИТЕЛЕЙ

СПОРИМ ТЫ НЕ ЗНАЕШЬ ТРИ ЖИВОТНЫХ НА БУКВУ С #shortsvideo

Tool Items!😍New Gadgets, Smart Appliances, Kitchen Utensils/Home Inventions #shorts #gadgets

INSTASAMKA - ЗА ДЕНЬГИ ДА (Премьера клипа, 2023, prod. realmoneyken)

Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence

Рет қаралды 38

Pupusse LINCS

Pupusse LINCS

Күн бұрын

Speaker : Alexandre Pham
Paper from Philip Jordan, Florian Grötschla, Flint Xiaofeng Fan and Roger Wattenhofer.
Abstract :
In Federated Reinforcement Learning (FRL), agents aim to collaboratively learn a common task, while each agent is acting in its local environment without exchanging raw trajectories. Existing approaches for FRL either (a) do not provide any fault-tolerance guarantees (against misbehaving agents), or (b) rely on a trusted central agent (a single point of failure) for aggregating updates. We provide the first decentralized Byzantine fault-tolerant FRL method. Towards this end, we first propose a new centralized Byzantine fault-tolerant policy gradient (PG) algorithm that improves over existing methods by relying only on assumptions standard for non-fault-tolerant PG. Then, as our main contribution, we show how a combination of robust aggregation and Byzantine-resilient agreement methods can be leveraged in order to eliminate the need for a trusted central entity. Since our results represent the first sample complexity analysis for Byzantine fault-tolerant decentralized federated non-convex optimization, our technical contributions may be of independent interest. Finally, we corroborate our theoretical results experimentally for common RL environments, demonstrating the speed-up of decentralized federations w.r.t. the number of participating agents and resilience against various Byzantine attacks.
Check out our website : www.lincs.fr/

Пікірлер

Understanding Reinforcement Learning error in image-based environments

56:21

Understanding Reinforcement Learning error in image-based environments

Pupusse LINCS

Рет қаралды 138

Vanilla Bayesian Optimization Performs Great in High Dimensions

35:19

Vanilla Bayesian Optimization Performs Great in High Dimensions

AutoML Seminars

Рет қаралды 5 М.

НУБ И ПРО СТРОЯТ ЗАЩИЩЕННУЮ ТЮРЬМУ ЗА 10 СЕКУНД / 1 МИНУТА / 5 МИНУТ В МАЙНКРАФТ БИТВА СТРОИТЕЛЕЙ

27:29

НУБ И ПРО СТРОЯТ ЗАЩИЩЕННУЮ ТЮРЬМУ ЗА 10 СЕКУНД / 1 МИНУТА / 5 МИНУТ В МАЙНКРАФТ БИТВА СТРОИТЕЛЕЙ

DakPlay

Рет қаралды 4,8 МЛН

СПОРИМ ТЫ НЕ ЗНАЕШЬ ТРИ ЖИВОТНЫХ НА БУКВУ С #shortsvideo

0:37

СПОРИМ ТЫ НЕ ЗНАЕШЬ ТРИ ЖИВОТНЫХ НА БУКВУ С #shortsvideo

Katya Klon

Рет қаралды 2,7 МЛН

Tool Items!😍New Gadgets, Smart Appliances, Kitchen Utensils/Home Inventions #shorts #gadgets

0:10

Tool Items!😍New Gadgets, Smart Appliances, Kitchen Utensils/Home Inventions #shorts #gadgets

Tool Items

Рет қаралды 69 МЛН

INSTASAMKA - ЗА ДЕНЬГИ ДА (Премьера клипа, 2023, prod. realmoneyken)

2:40

INSTASAMKA - ЗА ДЕНЬГИ ДА (Премьера клипа, 2023, prod. realmoneyken)

INSTASAMKA

Рет қаралды 5 МЛН

Knowledge Graphs Construction using Small Language Models

40:46

Knowledge Graphs Construction using Small Language Models

Pupusse LINCS

Рет қаралды 129

How Much Can RIS Augment Sky Visibility: A Stochastic Geometry Approach

32:39

How Much Can RIS Augment Sky Visibility: A Stochastic Geometry Approach

Pupusse LINCS

Рет қаралды 106

Optimal time partitioning in V2V ISAC Systems

48:56

Optimal time partitioning in V2V ISAC Systems

Pupusse LINCS

Рет қаралды 117

Hierarchical Community Detection in Hierarchical Stochastic Block Models

52:03

Hierarchical Community Detection in Hierarchical Stochastic Block Models

Pupusse LINCS

Рет қаралды 46

Heroes: Lightweight Federated Learning with Neural Composition and Adaptive Local Update in ...

22:01

Heroes: Lightweight Federated Learning with Neural Composition and Adaptive Local Update in ...

Pupusse LINCS

Рет қаралды 65

uv: an extremely fast Python package installer and resolver & marimo notebooks

1:01:21

uv: an extremely fast Python package installer and resolver & marimo notebooks

Pupusse LINCS

Рет қаралды 133

II. International Political Science and Public Administration Symposium / EVALUATION SESSION 1

29:48

II. International Political Science and Public Administration Symposium / EVALUATION SESSION 1

TNKU İİBF TV

Рет қаралды 44

Optimizing Energy Consumption and Performance in Modern Cloud Systems

36:56

Optimizing Energy Consumption and Performance in Modern Cloud Systems

Pupusse LINCS

Рет қаралды 77

Lagrangian multipliers, normal cones and KKT optimality conditions

49:51

Lagrangian multipliers, normal cones and KKT optimality conditions

Pupusse LINCS

Рет қаралды 114

Prof. Błażej Miasojedow | Langevin Monte Carlo Beyond Lipschitz Gradient Continuity

50:50

Prof. Błażej Miasojedow | Langevin Monte Carlo Beyond Lipschitz Gradient Continuity

INI Seminar Room 2

Рет қаралды 25

НУБ И ПРО СТРОЯТ ЗАЩИЩЕННУЮ ТЮРЬМУ ЗА 10 СЕКУНД / 1 МИНУТА / 5 МИНУТ В МАЙНКРАФТ БИТВА СТРОИТЕЛЕЙ

27:29

НУБ И ПРО СТРОЯТ ЗАЩИЩЕННУЮ ТЮРЬМУ ЗА 10 СЕКУНД / 1 МИНУТА / 5 МИНУТ В МАЙНКРАФТ БИТВА СТРОИТЕЛЕЙ

DakPlay

Рет қаралды 4,8 МЛН