STITCH: Simultaneous Thinking and Talking with Chunked Reasoning for Spoken Language Models Paper • 2507.15375 • Published Jul 21 • 25
DeSTA2.5-Audio Collection 🔗https://arxiv.org/abs/2507.02768 🔗https://github.com/kehanlu/DeSTA2.5-Audio • 3 items • Updated Jul 21 • 1
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment Paper • 2507.02768 • Published Jul 3 • 3
Audio-Aware Large Language Models as Judges for Speaking Styles Paper • 2506.05984 • Published Jun 6 • 15
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks Paper • 2411.05361 • Published Nov 8, 2024 • 1
Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data Paper • 2409.20007 • Published Sep 30, 2024 • 1