Shayegan Omidshafiei
Shayegan Omidshafiei
Підтверджена електронна адреса в google.com - Домашня сторінка
Назва
Посилання
Посилання
Рік
Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability
S Omidshafiei, J Pazis, C Amato, JP How, J Vian
Proceedings of the 34th International Conference on Machine Learning (ICML …, 2017
3402017
OpenSpiel: A framework for reinforcement learning in games
M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ...
arXiv preprint arXiv:1908.09453, 2019
78*2019
α-rank: Multi-agent evaluation by evolution
S Omidshafiei, C Papadimitriou, G Piliouras, K Tuyls, M Rowland, ...
Scientific reports 9 (1), 1-29, 2019
642019
Learning to Teach in Cooperative Multiagent Reinforcement Learning
S Omidshafiei, DK Kim, M Liu, G Tesauro, M Riemer, C Amato, ...
AAAI 2019, Best Student Paper Honorable Mention, 2019
642019
Decentralized control of multi-robot partially observable Markov decision processes using belief space macro-actions
S Omidshafiei, AA Agha–Mohammadi, C Amato, SY Liu, JP How, J Vian
The International Journal of Robotics Research (IJRR), 0278364917692864, 2017
572017
Decentralized Control of Partially Observable Markov Decision Processes using Belief Space Macro-actions
S Omidshafiei, A Agha-mohammadi, C Amato, JP How
IEEE International Conference on Robotics and Automation (ICRA), 5962-5969, 2015
552015
A generalized training approach for multiagent learning
P Muller, S Omidshafiei, M Rowland, K Tuyls, J Perolat, S Liu, D Hennes, ...
International Conference on Learning Representations (ICLR), 2020
352020
From Poincaré recurrence to convergence in imperfect information games: Finding equilibrium via regularization
J Perolat, R Munos, JB Lespiau, S Omidshafiei, M Rowland, P Ortega, ...
International Conference on Machine Learning, 8525-8535, 2021
262021
Neural replicator dynamics: Multiagent learning via hedging policy gradients
D Hennes, D Morrill, S Omidshafiei, R Munos, J Perolat, M Lanctot, ...
Proceedings of the 19th International Conference on Autonomous Agents and …, 2020
26*2020
Learning for Multi-robot Cooperation in Partially Observable Stochastic Environments with Macro-actions
M Liu, K Sivakumar, S Omidshafiei, C Amato, JP How
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2017, 2017
262017
MAR-CPS: Measurable augmented reality for prototyping cyber-physical systems
S Omidshafiei, AA Agha-Mohammadi, YF Chen, NK Üre, JP How, JL Vian, ...
AIAA Infotech@ Aerospace, 0643, 2015
26*2015
Measurable augmented reality for prototyping cyberphysical systems: A robotics platform to aid the hardware prototyping and performance testing of algorithms
S Omidshafiei, AA Agha-Mohammadi, YF Chen, NK Ure, SY Liu, ...
IEEE Control Systems 36 (6), 65-87, 2016
232016
Multiagent evaluation under incomplete information
M Rowland, S Omidshafiei, K Tuyls, J Perolat, M Valko, G Piliouras, ...
arXiv preprint arXiv:1909.09849, 2019
222019
Graph-based Cross Entropy Method for Solving Multi-Robot Decentralized POMDPs
S Omidshafiei, A Agha-mohammadi, C Amato, SY Liu, JP How, J Vian
IEEE International Conference on Robotics and Automation (ICRA), 5395-5402, 2016
222016
Camera control for learning nonlinear target dynamics via Bayesian nonparametric Dirichlet-process Gaussian-process (DP-GP) models
H Wei, W Lu, P Zhu, S Ferrari, RH Klein, S Omidshafiei, JP How
2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 95-102, 2014
212014
Real World Games Look Like Spinning Tops
WM Czarnecki, G Gidel, B Tracey, K Tuyls, S Omidshafiei, D Balduzzi, ...
Conference on Neural Information Processing Systems (NeurIPS), 2020
182020
Information Value in Nonparametric Dirichlet-Process Gaussian-Process (DPGP) Mixture Models
H Wei, W Lu, P Zhu, S Ferrari, M Liu, RH Klein, S Omidshafiei, JP How
Automatica 74 (2016) 360–368, 2016
112016
Policy distillation and value matching in multiagent reinforcement learning
S Wadhwania, DK Kim, S Omidshafiei, JP How
2019 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2019
102019
Learning hierarchical teaching policies for cooperative agents
DK Kim, M Liu, S Omidshafiei, S Lopez-Cot, M Riemer, G Habibi, ...
arXiv preprint arXiv:1903.03216, 2019
92019
Simultaneous mapping and planning by a robot
A Aghamohammadi, SD Spindola, BF Behabadi, C Lott, S Omidshafiei, ...
US Patent 10,093,021, 2018
92018
У даний момент система не може виконати операцію. Спробуйте пізніше.
Статті 1–20