On the Generalization of Training-based ChatGPT Detection Methods Paper • 2310.01307 • Published Oct 2, 2023
ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists Paper • 2506.01241 • Published Jun 2 • 9