DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment Paper • 2507.02768 • Published Jul 3 • 3
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment Paper • 2507.02768 • Published Jul 3 • 3
Analyzing Mitigation Strategies for Catastrophic Forgetting in End-to-End Training of Spoken Language Models Paper • 2505.17496 • Published May 23
STITCH: Simultaneous Thinking and Talking with Chunked Reasoning for Spoken Language Models Paper • 2507.15375 • Published Jul 21 • 25
Mitigating Object Hallucinations via Sentence-Level Early Intervention Paper • 2507.12455 • Published Jul 16 • 7
Einstein Fields: A Neural Perspective To Computational General Relativity Paper • 2507.11589 • Published Jul 15 • 7
Evaluations of Large Audio-Language Models (LALMs) Collection This collection contains papers for various LALM evaluation frameworks. • 45 items • Updated Jul 17 • 1
Evaluations of Large Audio-Language Models (LALMs) Collection This collection contains papers for various LALM evaluation frameworks. • 45 items • Updated Jul 17 • 1