no code implementations • 1 Sep 2023 • Wanyi Chen, Mary L. Cummings
Results revealed variability and inconsistencies in both the participants' and the LLMs' choices, especially when different criteria and metrics disagree.
Model Selection