no code implementations • 29 Jan 2024 • Xiaobei Wang, Shuchang Liu, Xueliang Wang, Qingpeng Cai, Lantao Hu, Han Li, Peng Jiang, Kun Gai, Guangming Xie
Furthermore, we show that a reward-based future decomposition strategy can better express the item-wise future impact and improve the recommendation accuracy in the long term.
no code implementations • 4 Jun 2022 • Jianing Bai, Tianhao Zhang, Guangming Xie
In this paper, we explore the performance of multi-agent reinforcement learning-based cross-layer congestion control algorithms and present cooperation performance of two agents, known as MACC (Multi-agent Congestion Control).
no code implementations • 28 Feb 2022 • Shuai Li, Chen Wang, Guangming Xie
We study pursuit-evasion differential games between a faster pursuer moving in 3D space and an evader moving in a plane.
no code implementations • 9 Mar 2021 • Tianhao Zhang, Yueheng Li, Shuai Li, Qiwei Ye, Chen Wang, Guangming Xie
In this paper, the circle formation control problem is addressed for a group of cooperative underactuated fish-like robots involving unknown nonlinear dynamics and disturbances.
no code implementations • 1 Jan 2021 • Yueheng Li, Tianhao Zhang, Chen Wang, Jinan Sun, Shikun Zhang, Guangming Xie
We explore energy-based solutions for cooperative multi-agent reinforcement learning (MARL) using the idea of function factorization in centralized training with decentralized execution (CTDE).
Multi-agent Reinforcement Learning reinforcement-learning +3
no code implementations • 23 Jun 2020 • Xingwen Zheng, Wei Wang, Liang Li, Guangming Xie
Then four typical regression methods, including random forest algorithm, support vector regression, back propagation neural network, and multiple linear regression method are used for establishing regression models between the ALLS-measured HPVs and the relative states.