Search Results for author: Alex Beeson

Found 2 papers, 1 papers with code

Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning

no code implementations21 Nov 2022 Alex Beeson, Giovanni Montana

The ability to discover optimal behaviour from fixed data sets has the potential to transfer the successes of reinforcement learning (RL) to domains where data collection is acutely problematic.

Behavioural cloning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.