no code implementations • 18 Feb 2024 • Yasaman Jafari, Dheeraj Mekala, Rose Yu, Taylor Berg-Kirkpatrick
RL-based techniques can be used to search for prompts that when fed into a target language model maximize a set of user-specified reward functions.
no code implementations • 7 Dec 2020 • Yasaman Jafari, Nazanin Sabri, Behnam Bahrak
Sharing our goals with others and setting public challenges for ourselves is a topic that has been the center of many discussions.
Social and Information Networks