-
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
Paper • 2502.08946 • Published • 195 -
PRELUDE: A Benchmark Designed to Require Global Comprehension and Reasoning over Long Contexts
Paper • 2508.09848 • Published • 65 -
ttchungc/PRELUDE
Viewer • Updated • 1.16k • 340 • 16 -
ShunchiZhang/PhysiCo
Viewer • Updated • 600 • 92 • 6
Mo
BishopGorov
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
PRELUDE: A Benchmark Designed to Require Global Comprehension and
Reasoning over Long Contexts
authored
a paper
2 days ago
ComoRAG: A Cognitive-Inspired Memory-Organized RAG for Stateful Long
Narrative Reasoning
liked
a dataset
3 days ago
ShunchiZhang/PhysiCo
Organizations
None yet