no code implementations • NeurIPS 2011 • J. Z. Kolter
Off-policy learning, the ability for an agent to learn about a policy other than the one it is following, is a key element of Reinforcement Learning, and in recent years there has been much work on developing Temporal Different (TD) algorithms that are guaranteed to converge under off-policy sampling.
no code implementations • NeurIPS 2010 • J. Z. Kolter, Siddharth Batra, Andrew Y. Ng
Energy disaggregation is the task of taking a whole-home energy signal and separating it into its component appliances.