Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation Paper • 2502.16707 • Published Feb 23 • 13
LLM-Reasoning (training) Collection LLM reasoning papers, with RL and long COT. (Post)Training of LLM is involved. • 6 items • Updated Feb 16
LLM-Reasoning (training) Collection LLM reasoning papers, with RL and long COT. (Post)Training of LLM is involved. • 6 items • Updated Feb 16
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information? Paper • 2412.02611 • Published Dec 3, 2024 • 24
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information? Paper • 2412.02611 • Published Dec 3, 2024 • 24
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning Paper • 2207.14800 • Published Jul 29, 2022
Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control Paper • 2307.00117 • Published Jun 30, 2023 • 6