Cost-of-Pass: An Economic Framework for Evaluating Language Models Paper • 2504.13359 • Published Apr 17 • 5
KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval Paper • 2310.15511 • Published Oct 24, 2023 • 5
Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models Paper • 2309.15098 • Published Sep 26, 2023 • 7