no code implementations • 18 Sep 2018 • Izumi Karino, Kazutoshi Tanaka, Ryuma Niiyama, Yasuo Kuniyoshi
Moreover, this method switches isotropic exploration and directional exploration in parameter space with regard to obtained rewards.
OpenAI Gym reinforcement-learning +1