view article Article You could have designed state of the art positional encoding Nov 25, 2024 โข 233
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Paper โข 2503.09516 โข Published Mar 12 โข 28
Learning to Learn Faster from Human Feedback with Language Model Predictive Control Paper โข 2402.11450 โข Published Feb 18, 2024 โข 23