1 code implementation • 12 Sep 2023 • Chen-Xiao Gao, Chenyang Wu, Mingjun Cao, Rui Kong, Zongzhang Zhang, Yang Yu
Third, we train an Advantage-Conditioned Transformer (ACT) to generate actions conditioned on the estimated advantages.
Action Generation