Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities Paper • 2507.13158 • Published Jul 17 • 24
What is Flagged in Uncertainty Quantification? Latent Density Models for Uncertainty Categorization Paper • 2207.05161 • Published Jul 11, 2022 • 1
Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples Paper • 2310.07747 • Published Oct 11, 2023 • 1
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond Paper • 2310.06147 • Published Oct 9, 2023 • 1