Search Results for author: Andreas Merentitis

A Bandit Framework for Optimal Selection of Reinforcement Learning Agents

This helps the bandit framework to select the best agents early, since these rewards are smoother and less sparse than the environment reward.

Paper
Add Code

This work explores maximum likelihood optimization of neural networks through hypernetworks.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.