Aux-Think: Exploring Reasoning Strategies for Data-Efficient Vision-Language Navigation

Introduction

Aux-Think internalizes Chain-of-Thought (CoT) only during training, enabling efficient Vision-Language Navigation without explicit reasoning at inference, and achieving strong performance with minimal data.

Citation

@article{wang2025think,
  title={Aux-Think: Exploring Reasoning Strategies for Data-Efficient Vision-Language Navigation},
  author={Wang, Shuo and Wang, Yongcai and Li, Wanting and Cai, Xudong and Wang, Yucheng and Chen, Maiyue and Wang, Kaihui and Su, Zhizhong and Li, Deying and Fan, Zhaoxin},
  journal={arXiv preprint arXiv:2505.11886},
  year={2025}
}
Downloads last month
1
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support