no code implementations • 11 Jan 2024 • Zhuoyuan Mao, Yen Yu
This article introduces contrastive alignment instructions (AlignInstruct) to address two challenges in machine translation (MT) on large language models (LLMs).
1 code implementation • 5 Jun 2018 • Yen Yu, Acer Y. C. Chang, Ryota Kanai
This paper presents the Homeo-Heterostatic Value Gradients (HHVG) algorithm as a formal account on the constructive interplay between boredom and curiosity which gives rise to effective exploration and superior forward model learning.
2 code implementations • 22 Feb 2017 • Nicholas Guttenberg, Yen Yu, Ryota Kanai
In this method, the problem of action selection is reduced to one of gradient descent on the latent space of the generative model, with the model itself providing the means of evaluating outcomes and finding the gradient, much like how the reward network in Deep Q-Networks (DQN) provides gradient information for the action generator.