no code implementations • 8 Dec 2021 • Liran Szlak, Kristoffer Aberg, Rony Paz
During probabilistic learning organisms often apply a sub-optimal "probability-matching" strategy, where selection rates match reward probabilities, rather than engaging in the optimal "maximization" strategy, where the option with the highest reward probability is always selected.