Aux-Think: Exploring Reasoning Strategies for Data-Efficient Vision-Language Navigation
Introduction
Aux-Think internalizes Chain-of-Thought (CoT) only during training, enabling efficient Vision-Language Navigation without explicit reasoning at inference, and achieving strong performance with minimal data.
Citation
@article{wang2025think,
title={Aux-Think: Exploring Reasoning Strategies for Data-Efficient Vision-Language Navigation},
author={Wang, Shuo and Wang, Yongcai and Li, Wanting and Cai, Xudong and Wang, Yucheng and Chen, Maiyue and Wang, Kaihui and Su, Zhizhong and Li, Deying and Fan, Zhaoxin},
journal={arXiv preprint arXiv:2505.11886},
year={2025}
}
- Downloads last month
- 1
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support